IN THE UNITED STATES PATENT Alto TRADEMARK OFFICE 

Request for filing new utility patent application under 37 cfr 1.53(b) 

" (P043D2C2) 


Sir: 

Transmitted herewith for filing under 35 U.S.C. 111(a) and 37 
C.F.R. §1.53 (b) is a new utility patent application for: 

Title: INTEGRATED CIRCUIT I/O USING A 
HIGH PERFORMANCE BUS INTERFACE 


o 

H 
Oh 


* 

to the Assistant Commission for Patents D 
Washington, D.C. 20231 


U 


Inventors : Michael Farmwald 

Mark Horowitz 


This application is a CONTINUATION APPLICATION of: 


Inventors: Michael Farmwald 

Mark Horowitz 

Ser. No.: 08/798,520 Art Unit: 2511 

Filed: February 10, 1997 Examiner: T. Nguyen 

Title: INTEGRATED CIRCUIT I/O USING A 
HIGH PERFORMANCE BUS INTERFACE 


To effect the above-requested filing today: 

1. Attached is a copy of the prior application as originally 
filed, including : 

[X] Specification, Claims, and Abstract (125 pages) 

[X] Drawings: one (1) set of formal drawings having 14 sheets 

[X] One (1) Executed Declaration and Power of Attorney 


2. Incorporation by Reference: The entire disclosure of the 
prior application, from which a copy of the oath or 
declaration is supplied, is considered as being part of the 
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disclosure of the accompanying application and is hereby 
incorporated by reference therein. 


3. A Power of Attorney to Neil A. Steinberg, Reg. No. 34,735 is 
attached. That Power of Attorney revokes all other powers 
of attorney. The current address of Neil A. Steinberg is as 
in item 4 . 

4. Address all future communications to: 


Neil A. Steinberg 
5827 Osceola Road 
Bethesda, Maryland 2 0816 
Telephone : 301-229-7706 
Facsimile : 301-22 9-5882 


5. The Examiner f s attention is directed to both the second 
paragraph of guideline (2) in MPEP 609 and to the last 
paragraph of MPEP 2001.06(b) and to the submission in the 
prior application of the Information Disclosure Statement and 
document copies filed in Application Serial No. 08/798,520. 


6. Attached is a PRELIMINARY AMENDMENT which, among other 

things, cancels claims 1-150 and adds new claims 151-172. 
This Preliminary Amendment is to be entered BEFORE fee 

calculation. 


7 . FILING FEE 

(BASED ON THE NUMBER OF CLAIMS AS FILED AND CHANGED BY PRELIMINARY AMENDMENT) 

Basic Fee $ 760.00 

Additional Fees: 

Surcharge for more than 20 total claims (2 * $18 ) .... $ 36.00 

Surcharge for more than 3 independent claims {0 additional) £ - 0 - 

Surcharge for multiple dependent claims $ - 0 - 

Total Filing Fee $ 796.00 
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8 . Manner of Payment : 

[XX] A check payable to the Commissioner of Patents and 
Trademarks, in the amount of $796 .00 is enclosed as 
payment of the Total Filing Fee. 

[XX] The Commissioner is hereby authorized to charge any fees 
which may be required, or credit any overpayment to 
Deposit Account No. 50-0763 . A duplicate copy of this 
sheet is enclosed. 

[ ] The Commissioner is hereby authorized to charge any fees 
which may be required, or credit any overpayment to 

Deposit Account No. . A duplicate copy of this 

sheet is enclosed. 



Respectfully submitted, 


Date: November 20, 1998 


Neil A. Steinberg 
Reg. No. 34,735 
202-887-5662 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

(Case No. P043D2C2) 


In the Application of: 



} w J- w u^/ 


) Art Unit: 

Serial No: Continuation of 08/798,520 



) Before 

Filed: NOVEMBER 20, 1998 

) Examiner : 

Title: INTEGRATED CIRCUIT I/O USING A 


HIGH PERFORMANCE BUS INTERFACE 



Assistant Commissioner for Patents 
Washington, DC 20231 

PRELIMINARY AMENDMENT 

Dear Sir: 

Prior to the examination of the above -referenced application, 
kindly amend the application as follows: 

IN THE TITLE : 

Please delete the title and substitute --SYNCHRONOUS MEMORY 

DEVICE HAVING A DELAY TIME REGISTER AND METHOD OF OPERATING SAME--. 
IN THE SPECIFICATION : 

On page 3, line 9, delete "micro-processor" and substitute 
- -microprocessor- - . 

On page 6, line 1, delete "4,646,279" and substitute 
--4, 646, 270-- . 

On page 10, line 18, delete "Figure 7 shows" and substitute 
--Figures 7a and 7b show--. 


On page 10, line 21 , delete "Figure 8 shows" and substitute -- 
Figures 8a and 8b show--. 

On page 34, line 4, after "devices" insert --do--. 

On page 41, line 1, delete "or* "and substitute -- or--. 

On page 45, line 17, delete "Fig. 7" and substitute --Figures 
7a and 7b- - . 

On page 47, line 2, delete "Figure 8" and substitute 
- -Figure 8a- - . 

On page 47, line 5, delete "from left to right" and substitute 
-- from right to left--. 

On page 47, line 8, delete "right" and substitute --left--. 

On page 47, line 9, delete the first "left" and substitute 
- -right- - . 

On page 49, line 22, delete "primay" and substitute 
- -primary- - . 

On page 56, line 2, delete "Figurell" and substitute 
- -Figure 11- - . 

On page 60, line 10, after "147" insert --A, B--. 


IN THE CLAIMS : 

Kindly cancel claims 1-150, without prejudice. 


Kindly add the following claims : 

151. A synchronous semiconductor memory device having at least 
one memory section including a plurality of memory cells, the 
memory device comprising: 

clock receiver circuitry to receive an external clock signal; 
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a register which stores a value which is representative of a 
delay time after which the memory device responds to a read 
request ; and 

a plurality of output drivers to output data in accordance 
with the delay time and synchronously with respect to the external 
clock signal. 

152. The memory device of claim 151 further including: 
clock generation circuitry, coupled to the clock receiver 

circuitry, to generate at least one internal clock signal; and 

wherein the plurality of output drivers output data in 
response to the internal clock signal . 

153 . The memory device of claim 152 wherein the plurality of 
output drivers output data in response to a rising edge of the 
internal clock signal . 

154. The memory device of claim 151 further including: 

a delay locked loop, coupled to the clock receiver circuitry, 
to generate an internal clock signal using at least the external 
clock signal; and 

wherein the plurality of output drivers output data in 
response to the internal clock signal . 

155 . The memory device of claim 151 wherein the value which is 
representative of the delay time is stored in the register after 
power is applied to the device* 
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1 156. The memory device of claim 151 wherein the value stored 

2 in the register is representative of one of a plurality of 

3 different delay times. 

1 157. A synchronous semiconductor memory device having at least 

2 one memory section including a plurality of memory cells, the 

3 memory device comprising: 

4 clock receiver circuitry to receive an external clock signal; 

5 at least one register to store a value which is representative 

6 of a delay time; and 

7 wherein in response to a read request, the memory device 
tfl 8 outputs data in accordance with the delay time and synchronously 
v3 9 with respect to the external clock signal . 

vy 1 158. The memory device of claim 157 further including: 

ju 2 clock generation circuitry, coupled to the clock receiver 

flj 3 circuitry, to generate an internal clock signal; and 

1% 4 an output driver, coupled to the internal clock generation 

"™ 5 circuitry, to output the data in response to the internal clock 

6 signal. 

1 159. The memory device of claim 158 wherein the output driver 

2 outputs data in response to a rising edge of the internal clock 

3 signal. 

1 160. The memory device of claim 157 further including a delay 

2 locked loop, coupled to the clock receiver circuitry, to generate 
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an internal clock signal using at least the external clock signal 
and wherein the output driver outputs data in response to the 
internal clock signal . 


1 161. The memory device of claim 157 wherein the memory device, 

2 in response to a set register request, stores a value in the at 

3 least one register. 


1 162. The memory device of claim 157 wherein the value stored 

2 in the register is representative of one of a plurality of 

3 available delay times. 


%u i 163. A method of operating a synchronous semiconductor memory 

M 2 device having at least one memory section including a plurality of 

3 memory cells and a register for storing a value which is 

m 4 representative of a time delay after which the memory device 

fjj 5 responds to a read request, the method comprising: 

6 issuing a read request to the memory device wherein the memory 

7 device, in response to the read request, outputs data on a bus in 

8 accordance with the time delay and synchronously with respect to an 

9 external clock signal. 


1 164. The method of claim 163 further including issuing a set 

2 register request, wherein, in response to the set register request, 

3 the memory device stores the value in the register. 
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165. The method of claim 164 wherein the set register request 
and the value are provided to the memory device in a single request 
packet . 

166. The method of claim 163 wherein the read request and 
information identifying the time delay after which the memory 
device responds to a read request are provided to the memory device 
in a single request packet. 

167. The method of claim 163 further including the steps of: 
initializing the register in the memory device by issuing a 

set register request on the bus; and 

providing the value which is representative of the time delay. 

168. The method of claim 163 further including the step of 
identifying the memory device on the bus using a device 
identification code . 

169. The method of claim 168 wherein the device identification 
code and information identifying the time delay after which the 
memory device responds to a read request are provided to the memory 
device in a single request packet. 

170. The method of claim 168 wherein the device identification 
code is a unique device code. 
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171. The method of claim 168 wherein the device identification 


2 


code is a non-unique device code. 


1 


172. The method of claim 163 wherein the value stored in the 


2 


register is one of a plurality of available delay times. 


REMARKS 


This Preliminary Amendment seeks to place this application in 
condition for allowance. The instant application is a continuation 
of Application Serial No. 08/798,520. Application Serial No. 
08/798,520 has been allowed; the issue fee has been paid; and it 


will issue shortly. 

In this continuation application, Applicants present new 
claims which set forth novel and unobvious features of Applicants' 
invention. Applicants submit new claims 151-172 to more fully 
protect the instant invention. No new matter has been added. 


These newly submitted claims are believed to be fully 
supported by the specification as originally filed see, for 
example, Figures 2 and 10-13; page 14, line 3 to page 15, line 2; 
page 15, lines 18 to page 16, line 7; page 20, line 20 to page 21, 
line 20; page 23, line 6 to page 24, line 2; page 46, line 19 to 
page 48, line 17; and page 53, line 23 to page 59, line 2. 

In addition, Applicants have amended the specification to 
correct obvious spelling, typographical and grammatical errors. No 
new matter has been added. 
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Finally, accompanying this Preliminary Amendment is a Request 
to Approve Drawing Changes. In that Request, Applicants seek to 
amend Figure 10 to more fully reflect the discussion in the 
specification, in particular, page 55, lines 12-16 and page 58, 
lines 13-23. The proposed changes are indicated in red. No new 
matter has been added. Applicants respectfully request that the 
Examiner approve the proposed changes to Figure 10. A new Figure 
10 which incorporates the changes is also attached to the Request. 


Applicants respectfully request entry of the foregoing 
amendment prior to examination of this continuation application. 
Applicants submit that all of the claims present patentable subject 
matter which definitely set forth the novel and unobvious features 
of Applicants' invention. Accordingly, Applicants respectfully 
request allowance of all of the claims. 


CONCLUSION 


Respectfully submitted, 


Date: November 20, 1998 


id 



Neil A. Steinberg 
Reg. No. 34,735 
301-229-7706 
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Our Reft RamBus D-l 


Integrated Circuit I/O Using A 
5 High Performance Bus Interface 

FIELD OF THE INVENTION 

10 An Integrated circuit bus Interface for computer and 

video systems is described which allows high speed transfer of 
blocks of data, particularly to and from memory devices, with 
reduced power consumption and increased system reliability. A 

C3 new method of physically implementing the bus architecture is 

145 also described. 


,0 BACKGROUND OF THE INVENTION 

- Semi conductor computer memories have traditionally been 

L designed and structured to use one memory device for each bit, or 

}4o s&all group of bits, of any individual computer word, where the 

'ft 

word size is governed by the choice of computer. Typical word 
sizes range from 4 to 64 bits. Each memory device typically is 
connected in parallel to a series of address lines and connected 
to one of a series of data lines. When the computer seeks to 
25 read from or write to a specific memory location, an address is 
put on the address lines and some or all of the memory devices 
are activated using a separate device select line for each needed 

device* One or more devices may be connected to each data line 

- - - * - » 

but typically only a small number of data lines are connected to 
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a single memory device. Thus data line 0 is connected to 
device(s) 0, data line 1 is connected to device (s) 1, and so on. 
Data is thus accessed or provided in parallel for each memory 
read or write operation. For the system to operate properly, 
every single memory bit in every memory device must operate 
dependably and correctly. 

To understand the concept of the present invention, it 
is helpful to review the architecture of conventional memory 
devices. Internal to nearly all types of memory devices 
(including the most widely used Dynamic Random Access Memory 
(DRAM), Static RAM (SRAM) and Read Only Memory (ROM) devices), a 
large number of bits are accessed in parallel each time the 
system carries out a memory access cycle. However, only a small 
percentage of accessed bits which are available internally each 
time the memory device is cycled ever make it across the device 
boundary to the external world. 

Referring to Pig. 1, all modern DRAM, SRAM and ROM 
designs have internal architectures with row (word) lines 5 and 
column (bit) lines 6 to allow the memory cells to tile a two 
dimensional area 1* One bit of data is stored at the 
intersection of each word and bit line. When a particular word 
line is enabled, all of the corresponding data bits are 
transferred onto the bit lines. Some prior art DRAMs take 
advantage of this organisation to reduce the number of pins 
needed to transmit the address. The address of a given memory 
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cell is split into two addresses, row and column, each of which 
can be multiplexed over a bus only half as wide as the memory 
cell address of the prior art would have required. 

COMPARISON WITH PRIOR ART 

Prior art memory systems have attempted to solve the 
problem of high speed access to memory with limited success. 
U.S. Patent No. 3,821,715 (Hoff et. al.), was issued to Intel 
Corporation for the earliest 4-bit micro-processor. That patent 
describes a bus connecting a single central processing unit (CPU) 
with multiple RAMs and ROMs. That bus multiplexes addresses and 
data over a 4-bit wide bus and uses point-to-point control 
signals to select particular RAMs or ROMs. The access time is 
fixed and only a single processing element is permitted. There 
is no block-mode type of operation, and most important, not all 
of the interface signals between the devices are bused (the ROM 
and RAM control lines and the RAM select lines are point-to- 
point). 

In U.S. Patent No. 4,315,308 (Jackson), a bus 
connecting a single CPU to a bus interface unit is described. 
The invention uses multiplexed address, data, and control 
information over a single 16-bit wide bus. Block-mode operations 
are defined, with the length of the block sent as part of the 
control sequence. In addition, variable access-time operations 
using a "stretch" cycle signal are provided. There are no 
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multiple processing elements and no capability for mltiple 
outstanding requests, and again, not all of the interface signals 
are bused. 

In U.S. Patent Mo. 4,449,207 (Kung, et. al.), & DRAM is 
5 described which multiplexes address and data on an internal bus. 
The external interface to this DRAM is conventional, with 
separate control, address and data connections. 

In U.S. Patent Nos. 4,764,846 and 4,706,166 (Go), a 3-D 
package arrangement of stacked die with connections along a 
|0 single edge is described. Such packages are difficult to use 
[q because of the point-to-point wiring required to interconnect 
1 conventional memory devices with processing elements. Both 

patents describe complex schemes for solving these problems. No 
j\ attempt is made to solve the problem by changing the interface. 
JJ5 In U.S. Patent Ho. 3,969,706 (Proebsting, et. the 

s 2 current state-of-the-art DRAM interface is described. The 

r* « ■» 

address is two-way multiplexed, and there are separate pins for 
data and control (RAS, CAS, WE, CS). The number of pins grows 
with the sire of the DRAM', and many of the connections must be 

20 made point-to-point in a memory system using such DRAMs. 

There are many backplane buses described in the prior 
art, but not in the combination described or having the features 
of this invention. Many backplane buses multiplex addresses and 
data on a single bus (e.g., the HU bus). ELXSI and others have 

25 implemented split-transaction buses (U.S. Patent No* 4,595,923 
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and 4,481,625 (Roberts)). ELXSI has also Implemented a 
relatively low-voltage- swing current-mode ECL driver 
(approximately 1 V swing). Address-space registers are 
implemented on most backplane buses, as is some form of block 
5 mode operation. 

Nearly all modem backplane buses implement some type 
of arbitration scheme, but the arbitration scheme used in this 
invention differs from each of these ♦ U.S. Patent Nos. 4,837,682 
(Culler), 4,818,985 (Ikeda), 4,779,089 (Theus) and 4,745,548 
10 (Blahut) describe prior art schemes. All involve either log K 
y3 extra signals, (Theus, Blahut), where N is the number of 
I- potential bus requestors, or additional delay to get control of 
v3 the bus (Ikeda, Culler). None of the buses described in patents 
jU or other literature use only bused connections. All contain some 
}|j5 point-to-point connections on the backplane. None of the other 
aspects of this invention such as power reduction by fetching 
each data block from a single device or compact and low-cost 3-D 
packaging even apply to backplane buses. 

The clocking scheme used in this invention has not been 
20 used before and in fact would be difficult to implement in 

backplane buses due to the signal degradation caused by connector 
stubs. U.S. Patent No. 4,247,817 (Heller) describes a clocking 
scheme using two clock lines, but relies on ramp-shaped clock 
signals in contrast to the normal rise-time signals used in the 
25 present invention. 
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In U.S. Patent No. 4,646,279 (Vobs), a video RAM is 
described which implements a parallel-load, serial-oat shift 
register on the output of a DRAM. This generally allows greatly 
improved bandwidth (and has been extended to 2, 4 and greater 
width shift-out paths.) The rest of the interfaces to the DRAM 
(RAS, CAS, multiplexed address, etc.) remain the same as for 
conventional DRAMS. 

One object of the present invention is to use a new bus 
interface built into semiconductor devices to support high-speed 
access to large blocks of data from a single memory device by an 
external user of the data, such as a microprocessor, in an 
efficient and cost-effective manner. 

Another object of this invention is to provide a^- 
clocking scheme to permit high speed clock signals to be sent 
along the bus with minimal clock skew between devices. 

Another object of this invention is to allow mapping 
out defective memory devices or portions of memory devices. 

Another object of this invention is to provide a method 
for distinguishing otherwise identical devices by assigning a 
unique identifier to each device. 

Yet another object of this invention is to provide a 
method for transferring address, data and control information 
over a relatively narrow bus and to provide a method of bus 
arbitration when multiple devices seek to use the bus 
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Another object of this Invention is to provide a method 
of distributing a high-speed memory cache within the DRAM chips 
of a memory system which is much more effective than previous 
cache methods. 

Another object of this invention is to provide devices, 
especially DRAMs, suitable for use with the bus architecture of 
the invention* 

SUMMARY OF INVENTION 

The present invention includes a memory subsystem 
comprising at least two semiconductor devices, including at least - 
one memory device, connected in parallel to a bus, where the bus 
includes a plurality of bus lines for carrying substantially all 
address, data and control information needed by said memory 
devices, where the control information includes device-select 
information and the bus has substantially fewer bus lines than 
the number of bits in a single address, and the bus carries 
device- select information without the need for separate device- 
select lines connected directly to individual devices* 

Referring to Fig* 2, a standard DRAM 13, 14, ROM (or 
SRAM) 12, microprocessor CPU 11, I/O device, disk controller or 
other special purpose device such as a high speed switch is 
modified to use a wholly bus-based interface rather than the 
prior art combination of point-to-point and bus-based wiring used 
with conventional versions of these devices. The new bus 
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includes clock signals, power and multiplexed address, data and 
control signals* Zn a preferred implementation, 8 bus data lines 
and an AddressValid bus line carry address, data and control 
information for memory addresses up to 40 bits vide. Persons 
5 skilled in the art will recognise that 16 bus data lines or other 
numbers of bus data lines can be used to implement the teaching 
of this invention. The new bus is used to connect elements such 
as memory, peripheral, switch and processing units* 

In the system of this invention, DRAKs and other 
jlLO devices receive address and control information over the bus and 
transmit or receive requested data over the same bus* Each 

-»+■*■ 

t* wr 

M memory device contains only a single bus interface with no other 

M3 signal pins. Other devices that may be included in the system 

h can connect to the bus and other non-bus lines, such as 

f|5 input/output lines. The bus supports large data block transfers 

tip 0*r| 
(■ " 

2 and split transactions to allow a user to achieve high bus 

I*" " « 

utilisation. This ability to rapidly read or write a large block 
of data to one single device at a time is an important advantage 
of this invention. 

20 The DRAKs that connect to this bus differ from 

conventional DRAKs in a number of ways. Registers are provided 
which may store control information, device identification, 
device-type and other information appropriate for the chip such 
as the address range for each independent portion of the device. 

25 New bus interface circuits must be added and the internals of 
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prior art DRAM devices need to be modified eo they can provide 
and accept data to and from the bus at the peak data rate of the 
bus. This requires changes to the column access circuitry in the 
DRAM, with only a minimal increase in die size. A circuit is 
provided to generate a low skew internal device clock for devices 
on the bus, and other circuits provide for demultiplexing input 
and multiplexing output signals. 

High bus bandwidth is achieved by running the bus at a 
very high clock rate {hundreds of MHz) . This high clock rate is 
made possible by the constrained environment of the bus. The bus 
lines are controlled-impedance, doubly- terminated lines. For a 
data rate of 500 MHz, the maximum bus propagation time is less 
than 1 ns (the physical bus length is about 10 cm). In addition, 
because of the packaging used, the pitch of the pins can be very 
close to the pitch of the pads. The loading on the bus resulting 
from the individual devices is very small. In a preferred 
implementation, this generally allows stub capacitances of 1-2 pF 
and inductances of 0.5 - 2 nH. Each device 15 , 16 , 17, shown in 
Figure 3, only has pins on one side and these pins connect 
directly to the bus 18 . A transceiver device 19 can be included 
to interface multiple units to a higher order bus through 
pins 20. 

A primary result of the architecture of this invention 
is to increase the bandwidth of DRAM access. The invention also 
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reduces manufacturing and production costs, power consumption # 
and increases packing density and system reliability* 

BRIEF DESCRIPTION OP THE DRAWINGS 

5 Figure 1 is a diagram which illustrates the basic 2-D 

organization of memory devices. 

Figure 2 is a schematic block diagram which illustrates 
the parallel connection of all bus lines and the serial Reset 
line to each device in the system. 
W Figure 3 is a perspective view of a system of the 

;S invention which illustrates the 3-D packaging of semiconductor 
!\ devices on the primary bus. 

Figure 4 shows the format of a request packet. 
Figure 5 shows the format of a retry response from a 

15 slave. 

Q Figure 6 shows the bus cycles after a request packet 

CO collision occurs on the bus and how arbitration is handled. 

Figure 7 shows the timing whereby signals from two 
devices can overlap temporarily and drive the bus at the same 
20 time . 

Figure 8 shows the connection and timing between bus 
clocks and devices on the bus. 

Figure 9 is a perspective view showing how transceivers 
can be used to connect a number of bus units to a transceiver 
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bus. Figure 10 is a block and schematic diagram of 

input/output circuitry used to connect devices to the bus* 

Figure 11 is a schematic diagram of a clocked sense- 
amplifier used as a bus input receiver* 

Figure 12 is a block diagram showing how the internal 
device clock is generated from two bus clock signals using a set 
of adjustable delay lines. 

Figure 13 is a timing diagram showing the relationship 
of signals in the block diagram of Figure 12. 

Figure 14 is timing diagram of a preferred means of 
implementing the reset procedure of this invention. 

Figure 15 is a diagram illustrating the general 
organization of a 4 Mbit DRAM divided into 8 subarrays. 

DETAILED DESCRIPTION 

The present invention is designed to provide a high 
speed, multiplexed bus for communication between processing 
devices and memory devices and to provide devices adapted for use 
in the bus system. The invention can also be used to connect 
processing devices and other devices, such as I/O interfaces or 
disk controllers, with or without memory devices on the bus. The 
bus consists of a relatively small number of lines connected in 
parallel to each device on the bus. The bus carries 
substantially all address, data and control information needed by 
devices for communication with other devices on the bus. In many 

4 
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systems using the present Invention, the bus carries almost every 
signal between every device in the entire system. Share is no 
need for separate device-select lines since device-select 
information for each device on the bus is carried over the bus. 
There is no need for separate address and data lines because 
address and data information can be sent over the same lines. 
Using the organization described herein, very large addresses (40 
bits in the preferred implementation) and large data blocks (1024 
bytes) can be sent over a small number of bus lines (8 plus one 
control line in the preferred implementation). 

Virtually all of the signals needed by a computer" 
system can be sent over the bus. Persons skilled in the art 
recognise that certain devices, such as CPUs, may be connected to 
other signal lines and possibly to independent buses, for example 
a bus to an Independent cache memory, in addition to the bus of 
this invention. Certain devices, for example cross-point 
switches, could be connected to multiple, independent buses of 
this invention. In the preferred implementation, memory devices 
are provided that have no connections other than the bus 
connections described herein and CPUs are provided that use the 
bus of this invention as the principal, if not exclusive, 
connection to memory and to other devices on the bus. 

All modern DRAM, SRAM and ROM designs have internal 
architectures with row (word) and column (bit) lines to 
efficiently tile a 2-D area. Referring to Pig. 1, one bit of 
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data it stored at the intersection of each word line 5 and bit 
line 6* Hhen a particular word line is enabled, all of the 
corresponding data bits are transferred onto the bit lines. This 
data, about 4000 bits at a time in a 4 MBit DRAM, is then loaded 
into column sense amplifiers 3 and held for use by the I/O 
circuits . 

In the invention presented here, the data from the 
sense amplifiers is enabled 32 bits at a time onto an internal 
device bus running at approximately 125 MHz. This internal 
device bus moves the data to the periphery of the devices where 
the data is multiplexed into an 8 -bit wide external bus 
interface, running at approximately 500 MHz. 

The bus architecture of this invention connects master 
or bus controller devices, 6uch as CPUs, Direct Memory Access 
devices (DMAs) or Floating Point Units (FPUs) , and slave devices, 
such as DRAM, SRAM or ROM memory devices. A slave device 
responds to control signals; a master sends control signals. 
Persons skilled in the art realize that some devices may behave 
as both master and slave at various times, depending on the mode 
of operation and the state of the system. For example, a memory 
device will typically have only slave functions, while a DMA 
controller, disk controller or CPU may include both slave and 
master functions. Many other semiconductor devices, including 
I/O devices, disk controllers, or other special purpose devices 
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•uch as high speed switches can be modified for use with the bus 
of this invention. 

Each semiconductor device contains a set of internal 
registers, preferably including a device identification (device 
ID) register, a device-type descriptor register, control 
registers and other registers containing other information 
relevant to that type of device. In a preferred implementation, 
semiconductor devices connected to the bus contain registers 
which specify the memory addresses contained within that device 
and access-time registers which store a set of one or more delay 
times at which the device can or should be available to send or 
receive data. 

Most of these registers can be modified and preferably 
are set as part of an initialization sequence that occurs when 
the system is powered up or reset. During the initialization 
sequence each device on the bus is assigned a unique device ID 
number, which is stored in the device ID register, A bus master 
can then use these device ID numbers to access and set 
appropriate registers in other devices, including access-time 
registers, control registers, and memory registers, to configure 
the system. Each slave may have one or several access-time 
registers (four in a preferred embodiment). In a preferred 
embodiment, one access-time register in each slave is permanently 
or semi -permanently programmed with a fixed value to facilitate 

High Performance Bus Interface -14- 


certain control functions . A preferred implementation of an 
initialisation sequence is described below in more detail. 

All information sent between master devices and slave 
devices is sent over the external bus, which, for example, may be 
8 bits wide. This is accomplished by defining a protocol whereby 
a master device, such as a microprocessor, seizes exclusive 
control of the external bus (i.e., becomes the bus master) and 
initiates a bus transaction by sending a request packet (a 
sequence of bytes comprising address and control information) to 
one or more slave devices on the bus. An address can consist of 
16 to 40 or more bits according to the teachings of this 
invention. Each slave on the bus must decode the request packet 
to see if that slave needs to respond to the packet. The slave 

+ 

that the packet is directed to must then begin any internal 
processes needed to carry out the requested bus transaction at 
the requested time. The requesting master may also need to 
transact certain internal processes before the bus transaction 
begins. After a specified access time the slave (s) respond by 
returning one or more bytes (8 bits) of data or by storing 
information made available from the bus. More than one access 
time can be provided to allow different types of responses to 
occur at different times. 

A request packet and the corresponding bus access are 
separated by a selected number of bus cycles, allowing the bus to 
be used in the intervening bus cycles by the same or other 
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rasters for additional requests or brief bus accesses* Thus 
multiple, independent accesses are permitted , allowing maximum 
utilisation of the bus for transfer of short blocks of data. 
Transfers of long blocks of data use the bus efficiently even 
without overlap because the overhead due to bus address, control 
and access times is small compared to the total time to request 
and transfer the block. 

Device Address Mapping 

Another unique aspect of this invention is that each 
memory device is a complete, independent memory subsystem with 
all the functionality of a prior art memory board in a 
conventional backplane-bus computer system. Individual memory 
devices may contain a single memory section or may be subdivided 
into more than one discrete memory section. Memory devices 
preferably include memory address registers for each discrete 
memory section. A failed memory device (or even a subsection of 
a device) can be "mapped out" with only the loss of a small 
fraction of the memory, maintaining essentially full system 
capability . Mapping out bad devices cam be accomplished in two 
ways, both compatible with this invention. 

The preferred method uses address registers in each 
memory device (or independent discrete portion thereof) to store 
information which defines the range of bus addresses to which 
this, memory device will respond. This is similar to prior art 

■ 
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schemes used In memory boards in conventional backplane bus 
systems. The address registers can include a single pointer, 
usually pointing to a block of known sise, a pointer and a fixed 
or variable block size value or two pointers, one pointing to the 
5 beginning and one to the end (or to the "top" and "bottom") of 
each memory block* By appropriate settings of the address 
registers, a series of functional memory devices or discrete 
memory sections can be made to respond to a contiguous range of 
addresses, giving the system access to a contiguous block of good 
Jo memory, limited primarily by the number of good devices connected 
[q to the bus. A block of memory in a first memory device or memory 
ju section can be assigned a certain range of addresses, then a 
In block of memory in a next memory device or memory section can be 
assigned addresses starting with an address one higher (or lower, 
15 depending on the memory structure) than the last address of the 

m-t wh« 

% i previous block. 

W Preferred devices for use in this invention include 

device- type register information specifying the type of chip, 
including how much memory is available in what configuration on 

20 that device. A master can perform an appropriate memory test, 
such as reading and writing each memory cell in one or more 
selected orders, to test proper functioning of each accessible 
discrete portion of memory (based in part on information like 
device ID number and device-type) and write address values (up to 

25 40 bits in the preferred embodiment, 10 12 bytes), preferably 
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contiguous. Into device address -space registers. Non-functional 
or impaired memory sections can be assigned a special address 
value which the system can interpret to avoid using that memory. 

The second approach puts the burden of avoiding the bad 
devices on the system master or masters. CPUs and DMA 
controllers typically have some sort of translation look-aside 
buffers (TLBs) vhich map virtual to physical (bus) addresses. 
With relatively simple software, the TLBs can be programmed to 
use only working memory (data structures describing functional 
memories are easily generated). For masters which don't contain 
TLBs (for example, a video display generator), a small, simple 
RAM can be used to map a contiguous range of addresses onto the 
addresses of the functional memory devices. 

Either scheme works and permits a system to have a 
significant percentage of non-functional devices and still 
continue to operate with the memory which remains. This means 
that systems built with this invention will have much improved 
reliability over existing systems, including the ability to build 
systems with almost no field failures. 

Bus 

The preferred bus architecture of this invention 
comprises 11 signals: BusData[0»7] j AddrValid; Clkl and Clk2; 
plus an input reference level and power and ground lines 
connected in parallel to each device. Signals are driven onto 
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the bus during conventional bus cycles. The notation 
*Signal[isj] • refers to a specific range of signals or lines, for 
example, BusData[0t7] means BusDataO, BusDatal, ♦ . BusData7. 
The bus lines for BusData[0t7] signals form a byte -wide, 
multiplexed data/address /control bus. AddrValid is used to 
indicate when the bus is holding a valid address request, and 
instructs a slave to decode the bus data as an address and, if 
the address is Included on that slave, to handle the pending 
request. The two clocks together provide a synchronised, high 
speed clock for all the devices on the bus. In addition to the 
bused signals, there is one other line (Resetln, ResetOut) 
connecting each device in series for use during initialization to 
assign every device in the system a unique device ID number 
(described below in detail). 

To facilitate the extremely high data rate of this 
external bus relative to the gate delays of the internal logic, 
the bus cycles are grouped into pairs of even/odd cycles* Note 
that all devices connected to a bus should preferably use the 
same even/odd labeling of bus cycles and preferably should begin 
operations on even cycles. This is enforced by the clocking 
scheme. 

Protocol and Bus Operation 

The bus uses a relatively simple, synchronous, split- 
transaction, block-oriented protocol for bus transactions. One 
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of the goals of the system is to keep the intelligence 
concentrated in the masters, thus keeping the slaves as simple as 
possible (since there are typically many more slaves than 
masters). To reduce the complexity of the slaves, a slave should 
preferably respond to a request in a specified time, sufficient 
to allow the slave to begin or possibly complete a device- 
internal phase including any internal actions that must precede 
the subsequent bus access phase. The time for this bus access 
phase is known to all devices on the bus - each master being 
responsible for making sure that the bus will be free when the 
bus access begins. Thus the slaves never worry about arbitrating 
for the bus. This approach eliminates arbitration in single 
master systems, and also makes the slave-bus interface simpler. 

In a preferred implementation of the invention, to 
initiate a bus transfer over the bus, a master sends out a 
request packet, a contiguous series of bytes containing address 
and control information. It is preferable to use a request 
packet containing an even number of bytes and also preferable to 
start each packet on an even bus cycle. 

The device-select function is handled using the bus 
data lines. AddrValid is driven, which instructs all slaves to 
decode the request packet address, determine whether they contain 
the requested address, and if they do, provide the data back to 
the master (in the case of a read request) or accept data from 
the master (in the case of a write request) in a data block 
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transfer. A master can also select a specific device by 
transmitting a device ID number in a request packet* In a 
preferred implementation, a special device ID number is chosen to 
indicate that the packet should be interpreted by all devices on 
the bus* This allows a master to broadcast a message, for 
example to set a selected control register of all devices with 
the same value. 

The data block transfer occurs later at a time 
specified in the request packet control Information, preferably 
beginning on an even cycle* A device begins a data block 
transfer almost immediately with a device-internal phase as the 
device initiates certain functions, such as setting up memory 
addressing, before the bus access phase begins. The time after 
which a data block is driven onto the bus lines is selected from 
values stored in slave access-time registers* The timing of data 
for reads and writes is preferably the same; the only difference 
is which device drives the bus. For reads, the slave drives the 
bus and the master latches the values from the bus. For writes 
the master drives the bus and the selected slave latches the 
values from the bus. 

In a preferred implementation of this invention shown 
in Figure 4, a request packet 22 contains 6 bytes of data — 4.5 
address bytes and 1.5 control bytes. Each request packet uses 
all nine bits of the multiplexed data/address lines (AddrValid 23 
+ BusData[0:7] 24) for all six bytes of the request packet. 
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Setting 23 AddrValid • 1 in an otherwise unused even cycle 
indicates the start of an request packet (control information ) . 
In a valid request packet, AddrValid 27 must be 0 in the last 
byte. Asserting this signal in the last byte invalidates the 
5 request packet. This is used for the collision detection and 
arbitration logic (described below). Bytes 25*26 contain the 
first 35 address bits, Address [0: 35] * The last byte contains 
AddrValid 27 (the invalidation switch) and 28, the remaining 
address bits. Address [36* 39], and BlockSise[0*3] (control 
%0 information) . 

% The first byte contains two 4 bit fields containing 

control information, AccessType[0:3] , an op code (operation code) 
™ which, for example, specifies the type of access, and 
f Master[0:3], a position reserved for the master sending the 

115 packet to include its master ID number. Only master numbers 1 
u through 15 are allowed - master number 0 is reserved for special 
m system commands. Any packet with Master[0:3] « 0 is an invalid 
or special packet and is treated accordingly* 

The AccessType field specifies whether the requested 
20 operation is a read or write and the type of access, for example, 
whether it is to the control registers or other parts of the 
device, such as memory. In a preferred implementation, 
AccessTypeJO] is a Read/Write switcht if it is a 1, then the 
operation calls for a read from the slave (the slave to read the 
25 requested memory block and drive the memory contents onto the 
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baa)} if it is a 0, the operation calls for a write into, the 
slave (the slave to read data from the bus and write it to 

memory). AccessType[l«3] provides op to 8 different access types 
for a slave. AccessType[li2] preferably indicates the timing of 
the response, which is stored in an access-time register, 
AccessRegtf. The choice of access-time register can be selected 
directly by having a certain op code select that register, or 
indirectly by having a slave respond to selected op codes with 
pre-selected access times (see table below). The remaining bit, 
AccessType[3] may be used to send additional information about 
the request to the slaves. 

One special type of access is control register access, 
which involves addressing a selected register in a selected 
slave. In the preferred implementation of this invention, 
AccessType[lx3] equal to zero indicates a control register 
request and the address field of the packet indicates the desired 
control register. For example, the most significant two bytes 
can be the device ID number (specifying which slave is being 
addressed) and the least significant three bytes can specify a 
register address and may also represent or include data to be 
loaded into that control register. Control register accesses are 
used to initialize the access-time registers, so it is preferable 
to use a fixed response time which can be preprogrammed or even 
hard wired, for example the value in AccessRegO, preferably 8 
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cycles. Control register access can also be used to initialise 
or modify other registers, including address registers* 

The method of this invention provides for access node 
control specifically for the DRAMs. One such access ©ode 

5 determines whether the access is page mode or normal RAS access. 
In normal mode (in conventional DRAMS and in this invention), the 
DRAM column sense amps or latches have been precharged to a value 
intermediate between logical 0 and 1. This precharging allows 
access to a row in the RAM to begin as soon as the access request 
lf| for either inputs (writes) or outputs (reads) is received and 

: Q allows the column sense amps to sense data quickly* In page mode 

IV (both conventional and in this invention), the DRAM holds the 
data in the column sense amps or latches from the previous read 
or write operation. If a subsequent request to access data is 

£1 directed to the same row, the DRAM does not need to wait for the 

□ data to be sensed (it has been sensed already) and access time 
for this data is much shorter than the normal access time. Page 
mode generally allows much faster access to data but to a smaller 
block of data (equal to the number of sense amps). However, if 

20 the requested data is not in the selected row, the access time is 
longer than the normal access time, since the request must wait 
for the RAM to precharge before the normal mode access can start. 
Two access-time registers in each DRAM preferably contain the 
access times to be used for normal and for page-mode accesses, 

25 respectively. 
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The access mode also determines whether the DRAM should 
precharge the sense amplifiers or should save the contents of the 
sense amps for a subsequent page mode access. Typical settings 
are "precharge after normal access* and "save after page mode 
access" but "precharge after page mode access" or "save after 
normal access" are allowed, selectable modes of operation. The 
DRAM can also be set to precharge the sense amps if they are not 
accessed for a selected period of time* 

In page mode, the data stored in the DRAM sense 
amplifiers may be accessed within much less time than it takes to 
read out data in normal mode (~10-20 nS vs. 40-100 nS). This 
data may be kept available for long periods. However, if these 
sense amps (and hence bit lines) are not precharged after an 
access, a subsequent access to a different memory word (row) will 
suffer a precharge time penalty of about 40-100 nS because the 
sense amps must precharge before latching in a new value. 

The contents of the sense amps thus may be held and 
used as a cache, allowing faster, repetitive access to small 
blocks of data. DRAM-based page-mode caches have been attempted 
in the prior art using conventional DRAM organisations but they 
are not very effective because several chips are required per 
computer word. Such a conventional page-mode cache contains many 
bits (for example, 32 chips x 4Kbits) but has very few 
independent storage entries. In other words, at any given point 
in time the sense amps hold only a few different blocks or memory 
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•locales- (a single block of 4K words, in the example above). 
Simulations have shown that upwards of 100 blocks are required to 
achieve high hit rates (>90% of requests find the requested data 
already in cache memory) regardless of the sire of each block. 
See, for example, Anant Agarwal, et. al. # -An Analytic Cache 
Model," ACM Transactions on Computer Systems, Vol. 7(2), pp. 
184-215 (May 1989). 

The organization of memory in the present invention 
allows each DRAM to hold one or more (4 for 4MBit DRAMS) 
separately- addressed and independent blocks of data. A personal 
computer or workstation with 100 such DRAMs (i.e. 400 blocks or 
locales) can achieve extremely high, very repeatable hit rates 
(98-99% on average) as compared to the lower (50-80%), widely 
varying hit rates using DRAMS organized in the conventional 
fashion. Further, because of the time penalty associated with 
the deferred precharge on a -miss- of the page-mode cache, the 
conventional DRAM-based page-mode cache generally has been found 
to work less well than no cache at all. 


t 

4 
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For DRAM slave access, the access types are preferably 


txsed in the following way: } 
Acces sType f 1 1 3 1 Use ' Acces sTlae 

0 Control Register Fixed, 8[AccessRegO] 


1 Unused Fixed, 8[AccessRegO] 

2*3 7 Unused Acces sRegl 

4*5 Page Mode DRAM AccessReg2 


6-7 ' Normal DRAM access AccessReg3 

Persons skilled in the art will recognise that a series of 
available bits could be designated as switches for controlling 
these access modes. For example: 

AccessType[ 2] * page mode/normal switch 
AccessType[3] * precharge/save-data switch 

BlockSize[0:3] specifies the size of the data block 
transfer. If BlockSize[0] is 0, the remaining bits are the 
binary representation of the block size (0-7). If BlockSize[0] 
is 1, then the remaining bits give the block size as a binary 
power of 2, from 8 to 1024. A zero-length block can be 
interpreted as a special command, for example, to refresh a DRAM 
without returning any data, or to change the DRAM from page mode 
to normal access mode or vice-versa. 
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BlockSizer 0.2*1 


Number Stl PYfres in Block 


0-7 

8 

9 


0-7 respectively 
8 

16 

32 

€4 

128 

256 

512 

1024 


10 
11 
12 
13 
14 
15 


Persons skilled in the art will recognize that other block size 
encoding schemes or values can be used* 


access time by reading or writing data from or to the bus over 
bus lines BusData[0:7] and AddrValid will be at logical 0. In a 
preferred embodiment , substantially each memory access will 
involve only a single memory device, that is, a single block will 
be read from or written to a single memory device. 

Retry Format 

In some cases, a slave may not be able to respond 
correctly to a request, e.g., for a read or write. In such a 
situation, the slave should return an error message, sometimes 
called a K(o)ACK(nowledge) or retry message. The retry message 
can include information about the condition requiring a retry, 
but this increases system requirements for circuitry In both 
slave and masters. A simple message indicating only that an 
error has occurred allows for a less complex slave, and the 
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In most cases, a slave will respond at the selected 


■aster can take whatever action is needed to understand and 

* 

correct the cause of the error. 

Por example, under certain conditions a slave might not 
be able to supply the requested data. During a page-mode access, 
the DRAM selected must be in page mode and the requested address 
must match the address of the data held in the sense amps or 
latches. Each DRAM can check for this match during a page-mode 
access. If no match is found, the DRAM begins precharging and 
returns a retry message to the master during the first cycle of 
the data block (the rest of the returned block is ignored). The 
master then must wait for the precharge time (which is set to 
accommodate the type of slave in question, stored in a special 
register, PreChargeReg ) , and then resend the request as a normal 
DRAM access (AccessType « 6 or 7). 

In the preferred form of the present invention, a slave 
signals a retry by driving AddrValid true at the time the slave 
was supposed to begin reading or writing data. A master which 
expected to write to that slave must monitor AddrValid during the 
write and take corrective action if it detects a retry message. 
Figure 5 illustrates the format of a retry message 28 which is 
useful for read requests, consisting of 23 AddrValid-1 with 
Master[0:3] « 0 in the first (even) cycle. Note that AddrValid 
is normally 0 for data block transfers and that there is no 
master 0 (only 1 through 15 are allowed). All DRAMs and masters 
can easily recognize such a packet as an invalid request packet, 
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and therefore a retry message. In this type of bus transaction 
all of the fields except for Master[0s3] and AddrValid 23 nay be 
used as information fields, although in the implementation 
described, the contents are undefined. Persons skilled in the 
art recognize that another method of signifying a retry message 
is to add a Datalnvalid line and signal to the bus. This signal 
could be asserted in the case of a HACK. 

Bus Arbitration 

In the case of a single master, there are by definition 
no arbitration problems. The master sends request packets and 
keeps track of periods when the bus will be busy in response to 
that packet. The master can schedule multiple requests so that 
the corresponding data block transfers do not overlap. 

The bus architecture of this invention is also useful 
in configurations with multiple masters. When two or more 
masters are on the same bus, each master must keep track of all 
the pending transactions, so each master knows when it can send a 
request packet and access the corresponding data block transfer* 
Situations will arise, however, where two or more masters send a 

m 

request packet at about the same time and the multiple requests 
must be detected, then sorted out by some sort of bus 
arbitration . 

There are many ways for each master to keep track of 
when the bus is and will be busy. A simple method is for each 
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master to maintain a bus -busy data structure, for exajnple by 
maintaining two pointers, one to indicate the earliest point in 
the future when the bus will be busy and the other to indicate 
the earliest point in the future when the bus will be free, that 
is, the end of the latest pending data block transfer. Using 
this information, each master can determine whether and when 
there is enough time to send a request packet (as described above 
under Protocol) before the bus becomes busy with another data 
block transfer and whether the corresponding data block transfer 
will interfere with pending bus transactions. Thus each master 
must read every request packet and update its bus-busy data 
structure to maintain information about when the bus is and will 
be free. 

With two or more masters on the bus, masters will 
occasionally transmit independent request packets during the same 
bus cycle. Those multiple requests will collide as each such 
master drives the bus simultaneously with different information, 
resulting in scrambled request information and neither desired 
data block transfer ♦ In a preferred form of the invention, each 
device on the bus seeking to write a logical 1 on a BusData or 
AddrValid line drives that line with a current sufficient to 
sustain a voltage greater than or equal to the high-logic value 
for the system. Devices do not drive lines that should have a 
logical 0; those lines are simply held at a voltage corresponding 
to a' low-logic value. Each master tests the voltage on at least 
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soma, preferably all, bus data and the AddrValid lines so the 
master can detect a logical '1' where the expected level is '0' 
on a line that it does not drive during a given bus cycle but 
another master does drive. 

Another way to detect collisions is to select one or 
more bus lines for collision signalling. Each master sending a 
request drives that line or lines and monitors the selected lines 
for more than the normal drive current (or a logical value of 
•>1"), indicating requests by more than one master. Persons 
skilled in the art will recognize that this can be implemented 
i with a protocol involving BusData and AddrValid lines or could be 
implemented using an additional bus line. 

In the preferred form of this invention, each master 
detects collisions by monitoring lines which it does not drive to 
see if another master is driving those lines. Referring to Fig. 
4, the first byte of the request packet includes the number of 
each master attempting to use the bus (Master[0:3] ) . If two 
masters send packet requests starting at the same point in time, 
the master numbers will be logical "or"ed together by at least 
those masters, and thus one or both of the masters, by monitoring 
the data on the bus and comparing what it sent, can detect a 
collision. Por instance if requests by masters number 2 (0010) 
and 5 (0101) collide, the bus will be driven with the value 
Kaster[0»3]«7 (0010 + 0101 » 0111). Master number 5 will detect 
that the signal Haster[2] » 1 and master 2 will detect that 
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Masterll] and Master[3] « 1, telling both masters that a 
collision has occurred. Another example is masters 2 and 11, for 
which the bus will be driven with the value Kasterf Oi 3]«11 (0010 
+ 1011 « 1011), and although master 11 can't readily detect this 
collision, master 2 can. When any collision is detected, each 
master detecting a collision drives the value of AddrValid 27 in 
byte 5 of the request packet 22 to 1, which is detected by all 
masters, including master 11 in the second example above, and 
forces a bus arbitration cycle, described below. 

Another collision condition may arise where master A 
sends a request packet in cycle 0 and master B tries to send a 
request packet starting in cycle 2 of the first request packet, 
thereby overlapping the first request packet. This will occur 
from time to time because the bus operates at high speeds, thus 
the logic in a 6econd-initiating master may not be fast enough to 
detect a request initiated by a first master in cycle 0 and to 
react fast enough by delaying its own request. Master B 
eventually notices that it wasn't supposed to try to send a 
request packet (and consequently almost surely destroyed the 
address that master A was trying to send), and, as in the example 
above of a simultaneous collision, drives a 1 on AddrValid during 
byte 5 of the first request packet 27 forcing an arbitration. 
The logic in the preferred implementation is fast enough that a 
master should detect a request packet by another master by cycle 

* 
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3 of the first request packet, so no master is likely to attempt 
to send a potentially colliding request packet later than 
cycle 2. 

Slave devices not need to detect a collision directly, 
5 but they must wait to do anything irrecoverable until the last 
byte (byte 5) is read to ensure that the packet is valid. A 
request packet with Master[0:3] equal to 0 (a retry signal) is 
ignored and does not cause a collision. The subsequent bytes of 
such a packet are ignored. 
10 To begin arbitration after a collision, the masters 

g wait a preselected number of cycles after the aborted request 
M packet (4 cycles in a preferred implementation), then use the 
yj next free cycle to arbitrate for the bus (the next available even 
U cycle in the preferred implementation). Each colliding master 

I.JL. 

15 jpy signals to all other colliding masters that it seeks to send a 
J request packet, a priority is assigned to each of the colliding 
masters, then each master is allowed to make its request in the 
order of that priority. 

Figure 6 illustrates one preferred way of implementing 

20 this arbitration. Each colliding master signals its intent to 
send a request packet by driving a single Bus Data line during a 
single bus cycle corresponding to its assigned master number (1- 
15 in the present example). During two-byte arbitration cycle 
29, byte 0 is allocated to requests 1-7 from masters 1-7., 

25 respectively, (bit 0 is not used) and byte 1 is allocated to 

* 
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requests 8-15 from masters 8-15, respectively. At least one 

* 

device and preferably each colliding master reads the values on 
the bos during the arbitration cycles to determine and store 
which masters desire to use the bus. Persons skilled in the art 
will recognize that a single byte can be allocated for 
arbitration requests if the system includes more bus lines than 
masters. Kore than 15 masters can be accommodated by using 
additional bus cycles. 

A fixed priority scheme (preferably using the master 
numbers, selecting lowest numbers first) is then used to 
prioritize, then sequence the requests in a bus arbitration queue 
which is maintained by at least one device. These requests are 
queued by each master in the bus-busy data structure and no 
further requests are allowed until the bus arbitration queue is 
cleared. Persons skilled in the art will recognize that other 
priority schemes can be used, including assigning, priority 
according to the physical location of each master. 

System Configuration/Reset 

In the bus-based system of this invention, a mechanism 
is provided to give each device on the bus a unique device 
identifier (device ZD) after power-up or under other conditions 
as desired or needed by the system. A master can then use this 
device ZD to access a specific device, particularly to set or 
modify registers of the specified device, including the control 
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and address registers. In the preferred embodiment, one master 
is assigned to carry out the entire system configuration process. 
The master provides a series of unique device ID numbers for each 
unique device connected to the bus system. In the preferred 
embodiment, each device connected to the bus contains a special 
device-type register which specifies the type of device, for 
instance CPU, 4 MBit memory, 64 MBit memory or disk controller. 
The configuration master should check each device, determine the 
device type and set appropriate control registers, including 
access-time registers. The configuration master should check 
each memory device and set all appropriate memory address 
registers . 

One means to set up unique device ID numbers is to have 
each device to select a device ID in sequence and store the value 
in an internal device ID register. For example, a master can 
pass sequential device ID numbers through shift registers in each 
of a series of devices, or pass a token from device to device 
whereby the device with the token reads in device ID information 
from another line or lines. In a preferred embodiment, device ID 
numbers are assigned to devices according to their physical 
relationship, for instance, their order along the bus. 

In a preferred embodiment of this invention, the device 
ID setting is accomplished using a pair of pins on each device, 
Hesetln and ResetOut. These pins handle normal logic signals and 
are used only during device ID configuration. On each rising 
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edge of the clock, each device copies Reset In (an Input) into a 
four-stage reset shift register. The output of the reset shift 
register is connected to ResetOut, which in turn connects to 
Reset In for the next sequentially connected device. 
Substantially all devices on the bus are thereby daisy-chained 
together. A first reset signal, for example, while Resetln at a 
device is a logical 1, or when a selected bit of the reset shift 
register goes from zero to non-zero, causes the device to hard 
reset, for example by clearing all internal registers and 
resetting all state machines. A second reset signal, for 
example, the falling edge of Resetln combined with changeable 
values on the external bus, causes that device to latch the 
contents of the external bus into the internal device ID register 
(Device[0:7J). 

To reset all devices on a bus, a master sets the 
Resetln line of the first device to a "1" for long enough to 
ensure that all devices on the bus have been reset (4 cycles 
times the number of devices — note that the maximum number of 
devices on the preferred bus configuration is 256 (8 bits), so 
that 1024 cycles is always enough time to reset all devices.) 
Then Resetln is dropped to "0* and the BusData lines are driven 
with the first followed by successive device ID numbers, changing 
after every 4 clock pulses. Successive devices set those device 
ID numbers into the corresponding device ID register as the 
falling edge of Resetln propagates through the shift registers of 
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the daisy-chained devices. Figure 14 shows Resetln at a first 
device going low while a toaster drives a first device ZD onto the 
bus data lines BusData[0i3] ♦ The first device then latches in 
that first device ID. After four clock cycles, the master 
changes BusData[0*3] to the next device ID number and ResetOut at 
the first device goes low, which pulls Resetln for the next 
daisy-chained device low, allowing the next device to latch in 
the next device ID number from BusData[0s3] . In the preferred 
embodiment, one master is assigned device ID 0 and it is the 
responsibility of that master to control the Resetln line and to 
drive successive device ID numbers onto the bus at the 
appropriate times. In the preferred embodiment, each device 
waits two clock cycles after Resetln goes low before latching in 
a device ID number from BusData[0:3] . 

Persons skilled in the art recognize that longer device 
ID numbers could be distributed to devices by having each device 
read in multiple bytes from the bus and latch the values into the 
device ID register. Persons skilled in the art also recognise 
that there are alternative ways of getting device ID numbers to 
unique devices. For instance, a series of sequential numbers 
could be clocked along the Resetln line and at a certain time 
each device could be instructed to latch the current reset shift 
register value into the device ID register. 

The configuration master should choose and set an 
access time in each access-time register in each slave to a 
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period sufficiently long to allow the slave to perform an actual, 
desired memory access. For example, for a normal DRAM access, 
this time must be longer than the row address strobe (RAS) access 
time. If this condition is not met, the slave may not deliver 
the correct data. The value stored in a slave access-time 
register is preferably one-half the number of bus cycles for 
which the slave device should wait before using the bus in 
response to a request. Thus an access time value of '1' would 
indicate that the slave should not access the bus until at least 
two cycles after the last byte of the request packet has been 
received. The value of AccessRegO is preferably fixed at 8 
(cycles) to facilitate access to control registers. 

The bus architecture of this invention can include more 
than one master device. The reset or initialization sequence 
should also include a determination of whether there are multiple 
masters on the bus, and if so to assign unique master ID numbers 
to each. Persons skilled in the art will recognize that there 

m 

are many ways of doing this. For instance, the master could poll 
each device to determine what type of device it is, for example, 
by reading a special register then, for each master device, write 
the next available master ID number into a special register. 

ECC 

Error detection and correction ("ECC") methods well 
known in the art can be implemented in this system. ECC 
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information typically is calculated for a block of data at the 
tine that block of data is first written into memory* The data 
block usually has an integral binary size, e.g. 256 bits, and the 
ECC information uses significantly fewer bits. A potential 
5 problem arises in that each binary data block in prior art 

schemes typically is stored with the ECC bits appended, resulting 
in a block size that is not an integral binary power* 

In a preferred embodiment of this invention, ECC 
P information is stored separately from the corresponding data, 
10 I which can then be stored in blocks having integral binary size. 
S ECC information and corresponding data can be stored, for 
~ example, in separate DRAM devices. Data can be read without ECC 

using a single request packet, but to write or read error- 
r : corrected data requires two request packets, one for the data and 
IffU a second for the corresponding ECC information. ECC information 
%Q may not always be stored permanently and in some situations the 
ECC information may be available without sending a request packet 
or without a bus data block transfer. 

In a preferred embodiment, a standard data block size 
20 can be selected for use with ECC, and the ECC method will 
determine the required number of bits of information in a 
corresponding ECC block. RAMs containing ECC information can be 
programmed to store an access time that is equal tot (1) the 
access time of the normal RAM (containing data) plus the time to 
25 access a standard data block (for corrected data) minus the time 
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to send a request packet (€ bytes); or' (2) the access time of a 
normal RAM minus the time to access a standard BCC block minus 
the time to send a request packet* To read a data block and the 
corresponding ECC block, the master simply Issues a request for 
the data Immediately followed by a request for the ECC block. 
The ECC RAM will wait for the selected access time then drive Its 
data onto the bus right after (In case (1) above)) the data RAM 
has finished driving out the data block. Persons skilled In the 
art will recognise that the access time described In case (2) 
above can be used to drive ECC data before the data Is driven 
onto the bus lines and will recognise that writing data can be 
done by analogy with the method described for a read. Persons 
skilled in the art will also recognize the adjustments that must 
be made in the bus-busy structure and the request packet 
arbitration methods of this invention in order to accommodate 
these paired ECC requests. 

Since this system is quite flexible, the system 
designer cam choose the size of the data blocks and the number of 
ECC bits using the memory devices of this invention. Note that 
the data stream on the bus can be interpreted in various ways. 
Por instance the sequence can be 2* data bytes followed by 2 m ECC 
bytes (or vice versa), or the sequence can be 2 k iterations of 8 
data bytes plus 1 ECC byte. Other information, such as 
information used by a directory-based cache coherence scheme, can 
also -be managed this way. See, for example, Anant Agarwal, et 
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al., "Sc&leabl. Directory Schemes for Cache Consistency,* 15th 
IntmmMtlonal SympoBlum on Computer Architecture, SUM 2J88, pp. 
980-289* Those skilled in the art will recognise alternative 
SMthods of implementing BCC schemes that are within the teachings 
of this invention. 

Low Power 3-D Packaging 

Another major advantage of this invention is that it 
drastically reduces the memory system power consumption. Nearly 
all the power consumed by a prior art DRAM is dissipated in 
performing row access. By using a single row access in a single 
RAM to supply all the bits for a block request (compared to a 
row-access in each of multiple RAMs in conventional memory 
systems) the power per bit can be made very small. Since the 
power dissipated by memory devices using this invention is 
significantly reduced , the devices potentially can be placed much 
closer together than with conventional designs. 

The bus architecture of this invention makes possible 
an innovative 3-D packaging technology. By using a narrow, 
srultiplexed (time-shared) bus, the pin count for an arbitrarily 
large memory device can be kept quite small - on the order of 20 
pins. Moreover, this pin count can be kept constant from one 
generation of DRAM density to the next. The low power 
dissipation allows each package to be smaller, with narrower pin 
pitches (spacing between the IC pins). With current surface 
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aount technology supporting pin pitches as low a* 20 mils, all 

" * > » # • 

of f-derice connactions can be iaqpleaented on a singla sdga of the 
Msory device. Semiconductor die useful in this invention 
preferably have connections or pads along one edge of the die 
which can then be wired or otherwise connected to the package 
pins with wires having similar lengths. This geometry also 
allows for very short leads, preferably with an effective lead 
length of less than 4 mm. Furthermore, this invention uses only 
bused interconnections, i.e., each pad on each device is 
connected by the bus to the corresponding pad of each other 
device • 

The use of a low pin count and an edge-connected bus 
permits a simple 3-D package, whereby the devices are stacked and 
the bus is connected along a single edge of the stack. The fact 
that all of the signals are bused is important for the 
implementation of a simple 3-D structure. Without this, the 
complexity of the "backplane" would be too difficult to make cost 
effectively with current technology* The individual devices in a 
stack of the present Invention can be packed quite tightly 
because of the low power dissipated by the entire memory system, 
permitting the devices to be stacked bumper- to-bumper or top to 
bottom. Conventional plastic-injection molded small outline (80) 
packages can be used with a pitch of about 2.5 nm (100 mils), but 
the ultimate limit would be the device die thickness, which is 

*> J* * * 
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•boat an order of magnitude smaller, 0.2-0.5 am using current 


wafer technology 
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Bos Electrical Description 

By using devices with very low power dissipation and 
close physical packing, the bus can be made quite short, which in 
turn allows for short propagation times and high data rates . The 
bus of a preferred embodiment of the present invention consists 
of a set of resistor-terminated controlled impedance transmission 
lines which can operate up to a data rate of 500 MHs (2 ns 
cycles). The characteristics of the transmission lines are 
strongly affected by the loading caused by the DRAMs (or other 
•laves) mounted on the bus. These devices add lumped capacitance 
to the lines which both lowers the impedance of the lines and 
decreases the transmission speed. In the loaded environment, the 
bus impedance is likely to be on the order of 25 ohms and the 
propagation velocity about c/4 (c - the speed of light) or 7.5 
cm/ns. To operate at a 2 ns data rate, the transit time on the 
bus should preferably be kept under 1 ns, to leave 1 ns for the 
setup and hold time of the input receivers (described below) plus 
clock skew. Thus the bus lines must be kept quite short, under 
•bout 8 cm for maximum performance. Lower performance systems 
may have much longer lines, e.g. a 4 ns bus may have 24 cm lines 
(3 ns transit time, 1 ns setup and hold time). ~ — 
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In the preferred embodiment, the but uses current 
source drivers. Each output suet be able to sink SO aft* which 
provides en output swing of about 500 aV or nore. In the 
preferred embodiment of this invention, the bus is active low. 
The unasserted state (the high value) is preferably considered a 
logical zero, and the asserted value (low state) is therefore a 
logical 1. Those skilled in the art understand that the method 
of this invention can also be implemented using the opposite 
logical relation to voltage. The value of the unasserted state 
is set by the voltage on the termination resistors, and shou ld be 
high enough to allow the outputs to act as current sources, while 
being as low as possible to reduce power dissipation. These 
constraints may yield a termination voltage about 2V above jground 
in the preferred implementation. Current source drivers cause 
the output voltage to be proportional to the sum of the sources 

driving the bus. 

Referring to Fig. 7, although there is no stable 
condition where two devices drive the bus at the same time, 
conditions can arise because of propagation delay on the wires 
where one device, A 41, can start driving its part of the bus 44 
while the bus is still being driven by another device, B 42 
(already asserting a logical 1 on the bus). In a system using 
current drivers, when B 42 is driving the bus (before time 46), 
the value at points 44 and 45 is logical 1. If B 42 switches off 
at time 46 just when A 41 switches on, the additional drive by 
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device A 41 causes the voltage at the output 44 of A 41 to drop 
briefly below the normal value. The voltage returns to its 
normal value at time 47 when the effect of device B 42 turning 
off is felt. The voltage at point 45 goes to logical 0 when 
device B 42 turns off, then drops at time 47 when the effect of 
device A 41 turning on is felt. Since the logical 1 driven by 
current from device A 41 is propagated irrespective of the 
previous value on the bus, the value on the bus is guaranteed to 
settle after one time of flight (U) delay, that is, the time it 
takes a signal to propagate from one end of the bus to the other. 
If a voltage drive was used (as in ECL wired-ORing ) , a logical 1 
on the bus (from device B 42 being previously driven) would 
prevent the transition put out by device A 41 being felt at the 
most remote part of the system, e.g., device 43, until the 
turnof f waveform from device B 42 reached device A 41 plus one 
time of flight delay, giving a worst case settling time of twice 
the time of flight delay. 

Clocking 

Clocking a high speed bus accurately without 
introducing error due to propagation delays can be implemented by 
having each device monitor two bus clock signals and then derive 
internally a device clock, the true system clock. The bus clock 
information can be sent on one or. 'two lines to provide a 
mechanism for each bused device to generate an internal device 
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clock with ttro skew relative to all the other devioe clocks. 
jf^fcJUif erring to Figure 8, in the preferred implementation, * bos 
clock generator 50 at one end of the bus propagates en early bus 
clock signal in one direction along the bus, for example on line 
5 53 from left to right, to the far end of the bus* The same clock 
signal then is passed through the direct connection shown to a 
second line 54, and returns as a late bus clock signal along the 
bus from the far end to the origin, propagating from right to 
left. A single bus clock line can be used if it is left 
10P exterminated at the far end of the bus, allowing the early bus 
sS clock signal to reflect back along the same line as a late bus 
u clock signal. 

Figure 8b illustrates how each device 51, 52 receives 
L each of the two bus clock signals at a different time (because of 
Xjfi propagation delay along the wires ) , with constant midpoint in 
tine between the two bus clocks along the bus. At each device 
51, 52, the rising edge 55 of Clockl 53 is followed by the rising 
edge 56 of Clock2 54. Similarly, the falling edge 57 of Clockl 
53 is followed by the falling edge 58 of Clock2 54. Shis 
20 waveform relationship is observed at all other devices along the 
bus. Devices which are closer to the clock generator have a 
greater separation between Clockl and Ciock2 relative to devices 
farther from the generator because of the longer time required 

« 

for each clock pulse to traverse the bus and return along line 
25 54, but the midpoint in time 59, €0 between corresponding rising 
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or falling edges is fixed because, for any given derloe, the 
length of each clock line between the far end of the bos end that 
device Is equal. Bach device must sample the two bos Clocks and 

■ - £ 

generate its own internal device clock at the midpoint of the 
two. 

Clock distribution problems can be further reduced by 
using a bus clock and device clock rate equal to the bus cycle 
data rate divided by two, that is, the bus clock period is twice 
the bus cycle period. Thus a 500 KHz bus preferably uses a 250 
KHz clock rate. This reduction in frequency provides two 
benefits. First it makes all signals on the bus have the same 
worst case data rates — data on a 500 MHz bus can only change 
every 2 ns. Second, clocking at half the bus cycle data rate 
makes the labeling of the odd and even bus cycles trivial, for 
example, by defining even cycles to be those when the internal 
device clock is 0 and odd cycles when the internal device clock 
is 1. 

Multiple Buses 

The limitation on bus length described above restricts 
the total number of devices that can be placed on a single bus. 
Using 2.5 mm spacing between devices, a single 6 cm bus will hold 
about 32 devices. Persons skilled in the art will recognise 
certain applications of the present invention wherein the overall 
data rate on the bus is adequate but memory or processing 
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requirements necessitate a such larger number of device* (many 
»ore than 32). Larger systems can easily be built using the 
^teachings of this invention by using one or sore memory 
subsystems, designated primary bus units, each of which consists 
of two or sore devices, typically 32 or close to the maximum 
allowed by bos design requirements, connected to a transceiver 
device. 

Referring to Figure 9, each primary bus unit can be 
mounted on a single circuit board 66, sometimes called a memory 
stick. Each transceiver device 19 in turn connects to a 
transceiver bus 65, similar or identical in electrical and other 
respects to the primary bus 18 described at length above. In a 
preferred implementation, all masters are situated on the 
transceiver bus so there are no transceiver delays between 
masters and all memory devices are on primary bus units so that 
all memory accesses experience an equivalent transceiver delay, 
but persons skilled in the art will recognise how to implement 
systems which have masters on more than one bus unit and memory 
devices on the transceiver bus as well as on primary bus units. 
In general, each teaching of this invention which refers to a 
memory device can be practiced using a transceiver device and one 
or more memory devices on an attached primay bus unit. Other 
devices, generically referred to as peripheral devices, including 
disk controllers, video controllers or I/O devices can also be 
attached to either the transceiver bus or a primary bus unit, as 
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desired. Persons skilled in the art will rscogniss haw to use a 
•ingle primary bos unit or multiple primary bus units at needed 

L. 

with a transceiver bus in certain system designs. 

r" 

The transceivers are quite simple in function. They 
detect request packets on the transceiver bus and transmit them 
to their primary bus unit. If the request packet calls for a 
write to a device on a transceiver's primary bus unit, that 
transceiver keeps track of the access time and block site and 
forwards all data from the transceiver bus to the primary bus 
I unit during that time. The transceivers also watch their primary 
bus unit, forwarding any data that occurs there to the 
transceiver bus. The high speed of the buses means that the 
transceivers will need to be pipelined, and will require an 
additional one or two cycle delay for data to pass through the 
transceiver in either direction. Access times stored in masters 
on the transceiver bus must be increased to account for 
transceiver delay but access times stored in slaves on a primary 
bus unit should not be modified. 

Persons s lei lied in the art will recognise that a more 
sophisticated transceiver can control transmissions to and from 

primary bus units. An additional control line, TracvrRW can be 

* 

bused to all devices on the transceiver bus, using that line in 
conjunction with the AddrValid line to indicate to all devices on 
the transceiver bus that the information on the data lines is* 1) 
a request packet, 2) valid data to a slave, 3) valid data from a 
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slave, or 4) invalid data (ox idle bat). Using this extra 
control line obviates the need for the transceivers to koap track 
of whan data needs to ba forwarded from its primary bus to the 
transceiver bus - all transceivers send all data from their 
primary bus to the transceiver bus whenever the control signal 
indicates condition 2) above* In a preferred implementation of 
this invention, if AddrValid and TracvrRW are both low, there is 
no bus activity and the transceivers should remain in an idle 
state. A controller sending a request packet will drive 
AddrValid high, indicating to all devices on the transceiver bus 
that a request packet is being sent which each transceiver should 
forward to its primary bus unit. Each controller seeking to 
write to a slave should drive both AddrValid and TracvrRW high, 
indicating valid data for a slave is present on the data lines. 
Each transceiver device will then transmit all data from the 
transceiver bus lines to each primary bus unit. Any controller 
expecting to receive information from a slave should also drive 
the TracvrRW line high, but not drive AddrValid, thereby 
indicating to each transceiver to transmit any data coming from 
any slave on its primary local bus to the transceiver bus. A 
still sore sophisticated transceiver would recognise signals 
addressed to or coming from its primary bus unit and transmit 
signals only at requested times* 

An example of the physical mounting" of the transceivers 
is shown in Figure 9. One important feature of this physical 
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arrangement is to Integrate the bus of each transceiver 19 with 
the original bos of DRAMs or other devices 15, 16, 17 on the 
primary bus unit €6* The transceivers 19 have pins on two sides, 
end are preferably mounted flat on the primary bus unit with a 
first set of pins connected to primary bus 18. A second set of 
transceiver pins 20, preferably orthogonal to the first set of 
pins, are oriented to allow the transceiver 19 to be attached to 
the transceiver bus 65 in much the same way as the DRAMs were 
attached to the primary bus unit. The transceiver bus can be 
generally planar and in a different plane, preferably orthogonal 
to the plane of each primary bus unit. The transceiver bus can 
also be generally circular with primary bus units mounted 
perpendicular and tangential to the transceiver bus. 

Using this two level scheme allows one to easily build 
a system that contains over 500 slaves (16 buses of 32 DRAMs 
each). Persons skilled in the art can modify the device ID 
scheme described above to accommodate more than 256 devices, for 
example by using a longer device ID or by using additional 
registers to hold some of the device ID. This scheme can be 
extended in yet a third dimension to make a second-order 
transceiver bus, connecting multiple transceiver buses by 
aligning transceiver bus units parallel to and on top of each 
other and busing corresponding signal lines through a suitable 
transceiver. Using such a second-order transceiver bus, one 
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could connect many thousands of slave devices into what Is 

*. 

^effectively a single bus. 

> 

pe^fo? Interface 

The device interface to the high-speed bus can be 
divided into three main parts. The first part is the electrical 
interface. This part includes the input receivers, bus drivers 
and clock generation circuitry. The second part contains the 
address comparison circuitry and timing registers. This part 
takes the input request packet and determines if the request is 
for this device, and if it is, starts the internal access and 
delivers the data to the pins at the correct time. The final 
part, specifically for memory devices such as DRAMs, is the DRAM 
column access path. This part needs to provide bandwidth into 
and out of the DRAM sense amps greater than the bandwidth 
provided by conventional DRAHs. The implementation of the 
electrical interface and DRAM column access path are described in 
more detail in the following sections. Persons skilled in the 
art recognise how to modify prior-art address comparison 
circuitry and prior-art register circuitry in order to practice 
the present Invention. 

Electrical Interface - Input/Output Circuitry 

A block diagram of the preferred input/output .circuit 
for address /data/control lines is shown in Figure 10. This 
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circuitry is particularly well-suited for use in DRAM devices bat 
it eta be used or modified by one skilled in the art for vsm in 
other devices connected to the bus of this invention. *Tt 
consists of a set of input receivers 71, 72 and output driver 76 
connected to input/output line 69 and pad 75 and circuitry to use 
the internal clock 73 and internal clock complement 74 to drive 
the input interface. The clocked input receivers take advantage 
of the synchronous nature of the bus. To further reduce the 
performance requirements for device input receivers, each device 
pin, and thus each bus line, is connected to two clocked 
receivers, one to sample the even cycle inputs, the other to 
sample the odd cycle inputs. By thus de-multiplexing the input 
70 at the pin, each clocked amplifier is given a full 2 ns cycle 
to amplify the bus low-voltage-swing signal into a full value 
CMOS logic signal. Persons skilled in the art will recognise 
that additional clocked input receivers can be used within the 
teachings of this invention. For example, four input receivers 
could be connected to each device pin and clocked by a modified 
internal device clock to transfer sequential bits from the bus to 
internal device circuits, allowing still higher external bus 
speeds or still longer settling times to amplify the bus low- 
voltage-swing signal into a full value CMOS logic signal* 

The output drivers are quite simple, and consist of a 
single HHOS pulldown transistor 76. This transistor is sired so 
that under worst case conditions it can still sink the 50 sA 
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required by the bus. For 0.8 micron CMOS technology* the 
transistor vill need to be about 200 microns long. Overall bus 
performance can be improved by using feedback techniques to 
control output transistor current so that the current through the 
device is roughly 50 mA under all operating conditions, although 
this is not absolutely necessary for proper bus operation. An 
example of one of many methods known to persons skilled in the 
art for using feedback techniques to control current is described 
in Bans Schumacher, et al., "CMOS Subnanosecond True-ECL Output 
Buffer, • J. Solid State Circuits, Vol. 25 (1), pp. 150-154 (Feb. 
1990). Controlling this current improves performance and reduces 
power dissipation. This output driver which can be operated at 
500 MHz, can in turn be controlled by a suitable multiplexer with 
two or more (preferably four) inputs connected to other internal 
chip circuitry, all of which can be designed according to well 
known prior art. 

The input receivers of every slave must be able to 
operate during every cycle to determine whether the signal on the 
bus is a valid request packet. This requirement leads to a 
number of constraints on the input circuitry. In addition to 
requiring small acquisition and resolution delays, the circuits 
must take little or no DC power, little AC power and inject very 
little current back into the input or reference lines. The 
standard clocked DRAM sense amp shown in Figure 11 satisfies all 
these requirements except the need for low input currents. When 
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this sense asp goes from sense to sample, the capacitance of the 
internal nodes 83 and 84 in Figure 11 is discharged through the 
reference line €8 and input 69, respectively* This particular 
current is small, but the sum of such currents from all the 
inputs into the reference lines summed over all devices can be 
reasonably large. 

The fact that the sign of the current depends upon on 
the previous received data makes matters worse. One way to solve 
this problem is to divide the sample period into two phases* 
During the first phase, the inputs are shorted to a buffered 
version of the reference level (which may have an offset)* 
During the second phase, the inputs are connected to the true 
inputs* This scheme does not remove the input current 
completely, since the input must still charge nodes 83 and 84 
from the reference value to the current input value, but it does 
reduce the total charge required by about a factor of 10 
(requiring only a 0.25V change rather than a 2.5V change). 
Persons skilled in the art will recognise that many other methods 
can be used to provide a clocked amplifier that will operate on 
very low input currents* 

One important part of the input/output circuitry 
generates an internal device clock based on early and late bus 
clocks* Controlling clock skew (the difference in clock timing 
between devices) is important in a system running with 2 ns 
cycles, thus the internal device clock is generated so the input 
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•ampler and the output driver operate as close in time as 
^possible to midway between the two bus clocks. ;.' 
5 A block diagram of the internal device clock generating 

circuit is shown in Figure 12 and the corresponding timin g 
diagram in Figure 13. The basic idea behind this circuit is 
relatively simple. A DC amplifier 102 is used to convert the 
email-swing bus clock into a full-swing CMOS signal. This signal 
is then fed into a variable delay line 103. The output of delay 
line 103 feeds three additional delay lines t 104 having a fixed 
delay; 105 having the same fixed delay plus a second variable 
delay; and 106 having the same fixed delay plus one half of the 
second variable delay. The outputs 107, 108 of the delay lines 
104 and 105 drive clocked input receivers 101 and 111 connected 
to early and late bus clock inputs 100 and 110, respectively. 
These input receivers 101 and 111 have the same design as the 
receivers described above and shown in Fig. 11. Variable delay 
lines 103 and 105 are adjusted via feedback lines 116, 115 so 
that input receivers 101 and 111 sample the bus clocks just as 
they transition. Delay lines 103 and 105 are adjusted so that 
the falling edge 120 of output 107 precedes the falling edge 121 
of the early bus clock, Clockl 53, by an amount of time 128 equal 
to the delay in input sampler 101. Delay line 108 is adjusted in 
the same way so that falling edge 122 precedes the falling edge 
123 of late bus clock, Clock2 54, by the delay 128 in input 
sampler 111. 
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Since the output* 107 and 108 are synchronised with the 
two bus clocks and the output 73 of the last delay line 106 is 
midway between outputs 107 and 108, that is, output 73 follows 
output 107 by the same amount of time 129 that output 73 precedes 
output 108, output 73 provides an internal device clock midway 
between the bus clocks. The falling edge 124 of internal device 
clock 73 precedes the time of actual input sampling 125 by one 
sampler delay. Kote that this circuit organisation automatically 
balances the delay in substantially all device input receivers 71 
and 72 (Fig. 10), since outputs 107 and 108 are adjusted so the 
bus clocks are sampled by input receivers 101 and 111 just as the 

bus clocks transition. 

In the preferred embodiment, two sets of these delay 
lines are used, one to generate the true value of the internal 
device clock 73, and the other to generate the complement 74 
without adding any inverter delay. The dual circuit allows 
generation of truly complementary clocks, with extremely small 
skew. The complement internal device clock is used to clock the 
'even' input receivers to sample at time 127, while the true 
internal device clock is used to clock the 'odd' input receivers 
to sample at time 125. The true and complement internal device 
clocks are also used to select which data is driven to the output 
drivers. The gate delay between the internal device clock and 
output circuits driving the bus is slightly greater than the 
corresponding delay for the input circuits, which means that the 
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»•* data always will bs driven on the bus slightly altar the old 
data has been sampled. 

DRAM Column Access Modification 

A block diagram of a conventional 4 MBit DRAM 130 is 
shown in Figure 15. The DRAM memory array is divided into a 
number of subarrays 150-157, for example, 8* Bach subarray is 
divided into arrays 148, 149 of memory cells. Row address 
selection is performed by decoders 146. A column decoder 147A, 
147B, including column sense amps on either side of the decoder, 
runs through the core of each subarray. These column sense amps\ 
can be set to precharge or latch the most-recently stored value, | 
as described in detail above. Internal I/O lines connect _each 
set of sense-amps, as gated by corresponding column decoders, to 
input and output circuitry connected ultimately to the device 
pins* These internal I/O lines are used to drive the data from 
the selected bit lines to the data pins (some of pins 131-145), 
or to take the data from the pins and write the selected bit 
lines* Such a column access path organised by prior art 
constraints does not have sufficient bandwidth to interface with 
a high speed bus. The method of this invention does not require 
changing the overall method used for column access, but does 
change implementation details. Many of these details have been 
implemented selectively in certain fast memory devices, .but never 
in conjunction with the bus architecture of this invention. 
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Running the internal I/O lines in the conventional way 

* 

-v-r^at high bus cycle rates is not possible. In the preferred 
r method, several (preferably 4) bytes are read or written during 
each cycle and the column access path is modified to run at a 
5 lower rate (the inverse of the number of bytes accessed per 

cycle, preferably 1/4 of the bus cycle rate). Three different 
techniques are used to provide the additional internal I/O lines 
required and to supply data to memory cells at this rate. First, 
the number of I/O bit lines in each subarray running through the 
l(ft column decoder 147 is increased, for example, to 16, eight for 
v3 each of the two columns of column sense amps and the column 
~t= decoder selects one set of columns from the "top" half 148 of 
%G subarray 150 and one set of columns from the "bottom" half 149 
during each cycle, where the column decoder selects one column 
ijy sense amp per I/O bit line. Second, each column I/O line is 
IS divided into two halves, carrying data independently over 

separate internal I/O lines from the left half 147A and right 
half 147B of each subarray (dividing each subarray into 
quadrants) and the column decoder selects sense amps from each 
20 right and left half of the subarray, doubling the number of bits 
available at each cycle. Thus each column decode selection turns 
on n column sense amps, where n equals four (top left and right, 
bottom left and right quadrants) times the number of I/O lines in 
the bus to each subarray quadrant (8 lines each x 4*32 lines in 
25 the preferred implementation). Finally, during each FAS cycle, 

High Performance Bus Interface -60- 


two different subarrays, e.g. 157 and 153, are accessed* Shis 
doubles again the available number of I/O lines containing data. 
Taken together, these changes increase the internal 2/0 bandwidth 
by at least a factor of 8. Four internal buses are used to route 
these internal I/O lines. Increasing the number of I/O lines and 
then splitting them in the middle greatly reduces the capacitance 
of each internal I/O line which in turn reduces the column access 
time, increasing the column access bandwidth even further. 

The multiple, gated input receivers described above 
allow high speed input from the device pins onto the internal I/O 
lines and ultimately into memory. The multiplexed output driver 
described above is used to keep up with the data flow available 
using these techniques. Control means are provided to select 
whether information at the device pins should be treated as an 
address, and therefore to be decoded, or input or output data to 
be driven onto or read from the internal I/O lines. 

Each subarray can access 32 bits per cycle, 16 bits 
from the left subarray and 16 from the right subarray. With 8 
I/O lines per sense-amplifier column and accessing two subarrays 
at a time, the DRAM can provide 64 bits per cycle. This extra 
I/O bandwidth is not needed for reads (and is probably not used), 
but may be needed for writes. Availability of write bandwidth is 
a more difficult problem than read bandwidth because over-writing 
a value in a aense-amplifier may be a alow operation, depending 
on how the sense amplifier is connected to the bit line. The 
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«xtra set of Internal I/O lines provides some bandwidth margin 
for write operations. . 

Persons skilled in the art will recognise that many 
variations of the teachings of this invention can be practiced 
that still fall within the claims of this invention which follow. 


/ 
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•hat im claimed is* 

1* A memory subsystem coosprislng 

two memory devices connected in parallel to * bus, 
•aid bus including a plurality of bus lines for 
carrying substantially all address, data and control in- 
formation needed by said memory devices, 

said control information including device-select 
information, 

said bus containing substantially fewer bus lines than 
the number of bits in a single address, and 

said bus carrying device-select information without the 
need for separate device-select lines connected directly to 
individual memory devices. 

2. The memory subsystem of claim 1 wherein said bus 
contains at least 8 bus lines adapted to carry at least 16 
address bits and at least 8 data bits. 

3. The memory subsystem of claim 1 wherein said bus also 
includes parallel lines for clock and power. 

4* A system comprising 

a memory subsystem of claim 1 wherein each bus of said 
memory subsystem is connected to its own transceiver device, 
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a transceiver but connecting said transceiver devices, 

, and 

a neans for transferring Information between each of 
said buses of said memory subsystems and said transceiver 
bus, whereby memory subsystems may be integrated into a 
larger system having more memory than an individual memory 
subsystem. 

■ 

5. The system of claim 4 having a plurality of memory 
subsystems . 

6. The system of claim 4 further comprising a master 
device connected to said transceiver bus. 

7. The system of claim 6 wherein said master device is 
selected from the group consisting of a central processing unit, 
a floating point unit and a direct memory access unit* 

8. The system of claim 4 further comprising a peripheral 
device connected to the transceiver bus, said peripheral device 
adapted for connection to other devices not on the bus* 

9. The system of claim 8 wherein said peripheral device is 
selected from the group consisting of an I/O interface port, a 
video controller and a disk controller* 
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10. The system of claim 5 wherein said transceiver bus is 

in a different plane than the plane of the bus of each of said 
memory subsystems. 

11. The system of claim 5 wherein the bus of each memory 
subsystem lies substantially in a subsystem bus plane and said 
transceiver bus lies substantially in a plane orthogonal to said 
subsystem bus plane. 

12. The system of claim 4 having at least two transceiver 
buses, each transceiver bus having a plurality of memory 
subsystem buses connected through a first transceiver to said 

transceiver bus, 

each of said transceiver buses being further connected to a 
second transceiver adapted to interface to a second-order 
transceiver bus, whereby each transceiver bus is connected 
through said second transceiver to form a second-order 
transceiver bus unit. 

13. A semiconductor subsystem bus for interconnecting 
semiconductor devices comprising 

a plurality of semiconductor devices connected in 
parallel to a bus, at least one of said semiconductor 

High Performance Bus Interface -65- 


devices being a memory device or e transceiver jdevioe which 
In turn is connected to a memory subsystem, 
f eaid bus including a plurality of bus lines for 

carrying substantially all address, data and control 
information needed by said semiconductor devices, 

said control information including semiconductor 
device-select information, 

said bus containing substantially fewer bus lines than 
the number of bits in a single address, and 

said bus carrying device-select Information without the 
need for separate device-select lines connected directly to 
individual semiconductor devices, and 

at least one modifiable register in each of the semi- 
conductor devices on said bus, said modifiable registers 
being accessible from said bus, vhereby the subsystem can be 
configured using signals transmitted on said bus. 

14. The semiconductor subsystem bus of claim 13 wherein one 
type of modifiable register is an access-time register designed 
to store a time delay after which a device may take some 
specified action on said bus. 

15. The semiconductor subsystem bus of claim 13 further 
comprising a semiconductor device having at least two access-time 
registers and 
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one of said access-time registers ia permanently programmed 
to contain a fixed valaa and at laaat ona of said access -tiJM 
registers can be modified by information carried on said bus. 


16. The semiconductor subsystem bus of claim 13 further 
comprising a memory device having at least one discrete memory 
section and also having a modifiable address register adapted to 

■ 

store memory address Information which corresponds to each said 
discrete memory section* 


17* The semi conductor subsystem bus of claim 16 wherein 
said memory address information comprises a pointer to said dis- 
crete memory section* 

18. The semiconductor subsystem bus of claim 16 wherein 
said discrete memory section has a top and a bottom and said 
memory address Information comprises pointers to said top and 
said bottom. 


19* The semiconductor subsystem bus of claim 16 wherein 
said memory address information comprises 

a pointer to said discrete memory section and 

a range value indicating the sire of said discrete 

memory section. 
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20. Ths semiconductor subsystem bus of claim 16 vhsrsin 
said address register* of each of said discrate memory eections 
of aach of aald memory davicaa connected to aaid baa are aat to 

contain memory address information that is different for each 
discrete memory section and such that the highest memory address 
in each discrete memory section is one less than the lowest 
memory address in another discrete memory section, 

whereby memory may be organised into one or a small number 
of contiguous memory blocks. 

21. She semiconductor subsystem bus of claim 16 further 
comprising a means for testing each of said discrete memory sec* 
tions of each of said memory devices for proper function, and 

for each non-functional discrete memory section, a 
means for setting at least one address register which 
corresponds to said discrete memory section to indicate that 
said discrete memory section is non-functional, 

for each functional discrete memory section, a means 
for setting at least one address register which corresponds 
to said discrete memory section to contain such 
corresponding address information. 

22* The semiconductor subsystem bus of claim 21 wherein 
said address registers corresponding to said discrete memory 

High Performance Bus Interface -68- 


sections are set to provide one contiguous memory block within 
the subsystem* 

23* The semiconductor subsystem bus of claim 13 wherein one 
of said modifiable registers is a device identification register 
which can be modified to contain a value unique to that 
semiconductor device* 

24* The semiconductor subsystem bus of claim 23 wherein 
said device identification register is set to contain a unique 
value which is a function of the physical position of that 
semiconductor device either along said bus or in relationship to 
other semiconductor devices or said bus. 

25. A bus subsystem comprising 

two semiconductor devices connected in parallel to a 
bus, wherein one of said semiconductor devices is a master 
device, 

said master device including a means for initiating bus 

transactions , 

said bus including a plurality of bus lines for 
carrying substantially all address, data and control 
information needed by said devices, ■ . 

said control information including device-select 
information , 
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Mid bos containing substantially fimr Ham than the 
unbar of bits in a single address , and j^^d~ 
* said bus carrying device-select information without the 

need for separate device-select lines connected directly to 
individual devices on said bus, whereby said master device 
initiates bus transactions which transfer information 
between said semiconductor devices on said bus. 

26* The bus subsystem of claim 25 wherein one of said 
semiconductor devices is a memory device connected to said bus, 
said memory device having at least one discrete memory section 
and also having a modifiable address register adapted to store 
memory address information which corresponds to each said 
discrete memory section* 

27* The bus subsystem of claim 26 wherein one of said 
semiconductor devices comprises a transceiver device connected in 
parallel to said bus and connected in parallel to a memory device 
on a bus other than said bus. 

28* The bus subsystem of claim 26 further including a means 
for said master device to request said memory device to prepare 
for a bus transaction by sending a request packet along said bus, 
said memory device and said master device each having a device* 
internal means to prepare to begin said bus transaction during a 
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device-internal phase and further having a bus access means to 
effect said bus transaction during a bus access phase, said 
request packet including 

a sequence of bytes containing address and control 
5 information , 

said control information including information about 
the requested bus transaction and about the access time/ 
which corresponds to a number of bus cycles, which needs to 
O intervene before beginning said bus-access phase, and 

10 said address information pointing to at least one 

CH memory location within one of said discrete memory sections 

yj of said memory device. 

jU 29. The bus subsystem of claim 28 wherein said memory 

i| device includes a means to read said control information and 
5 initiate said device-internal means at a time so as to complete 

said device-internal phase within said access time and begin said 
bus access phase after said number of bus cycles. 

20 30. The bus subsystem of claim 28 wherein said control 

information comprises an op code. 

31. The bus subsystem of claim 30 wherein said memory 
device includes sense amplifiers adapted to hold a bit of 
25 information or to precharge after a selected time and a means to 
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transfer a data block during a data block transfer either reading 
data from said memory device or writing data into said memory 
device, and 

wherein said op code instructs said memory device to 
activate a response means, said response means including a means 
to 

initiate a data block transfer, 
select the sire of said data block, 

select the time to initiate said data block transfer, 
access a control register, including reading from or 
writing to said control register, 

precharge said sense amplifiers after each of said data 

block transfers is complete, 

hold a bit of information in each of said sense 
amplifiers after each of said data block transfers is 

complete , or 

select normal or page-mode access* 

32 ♦ The bus subsystem of claim 31 wherein said data block 
transfer comprises a read from or a write to memory within a 
single memory device* 

33. 33ie bus subsystem of claim 28 further comprising a 
means for said master device to send control information to a 
specific one of said semiconductor devices on said bus by 
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Including in said request packet a device identification number 
unique to said, semiconductor device. 

34. The bus subsystem of claim 28 further comprising a 
means for said master device to send control information to a 
selected one of said discrete memory portions by including in 
said request packet a specific memory address. 

35. The bus subsystem of claim 28 further comprising a 
means for said master device to send control information to 
substantially all semiconductor devices on said bus by including 

* 

in said request packet a special device identification number 
which is recognized by said semiconductor devices. 

36. The bus subsystem of claim 28 wherein said control 
information specifies directly or indirectly the number of bus 
cycles for said master device and said memory device to wait 
before beginning said bus access phase. 

37. The bus subsystem of claim 36 wherein, for a data block 
transfer, said master device and said memory device use the same 
access time and same data block size regardless of whether said 
data block transfer is a read or write operation. 
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38. The bus subsystem of claim 28 wherein said control 
information further includes a block-size value that encodes and 
specifies the size of the block of data to be transferred. 

39. The bus subsystem of claim 38 wherein said block-size 
value is encoded as a linear value for relatively small block 
sizes values and is encoded as a logarithmic value for relatively 
larger block sizes. 

40. The bus subsystem of claim 38 wherein said block-size 
value is encoded using four bits, and where the encoded value is 


41. The bus subsystem of claim 26 wherein said memory 
device is a DRAM device containing 

a plurality of sense amplifiers, 


Encoded Value 


Block Size (Bvtes ) 


0 
1 
2 
3 
4 
5 
6 
7 
8 
9 


0 
1 
2 
3 
4 
5 
6 
7 
8 


10 
11 
12 
13 
14 
15 


16 

32 

64 

128 

256 

512 

1024 
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a means to hold said sense amplifiers in an unmodified 
state after a read or write operation, leaving the device in 
page mode, 

a means to pre charge said sense amplifiers and 

a means for selecting whether to precharge said sense 

amplifiers or to hold said sense amplifiers in an unmodified 

state • 

>* 

42. The bus subsystem of claim 28 wherein said request 
packet comprises an even number of bytes* 

43. The bus subsystem of claim 28 further including a means 
for generating and controlling a plurality of bus cycles, during 
which said bus carries said address, data and control 
information, and wherein alternate said bus cycles are designated 
odd cycles and even cycles, respectively, and wherein said 
request packet begins only on an even cycle. 

44. The bus subsystem of claim 28 further including a means 
for generating ECC information corresponding to a block of data 
and a means for using said ECC information to correct errors in 
storing or reading said block of data, wherein said ECC 
information may be stored separately from said block of data* 
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45. The bus subsystem of claim 44 further comprising at 
least two of said memory devices wherein said ECC information and 
said corresponding block of data are stored in a first and a 
second said memory device, respectively, and said master device 
5 includes a means to write or read said block of data with error 
correction by sending separate ones of said request packets for 
said ECC information and for said corresponding block of data. 


46. A bus subsystem comprising 


10il 


a memory device and a master device connected in 


parallel on a bus, 


a means for said master device to send a request 


packet and initiate a bus transaction and 


a means for said master device to keep track of 


lif 


current and pending bus transactions, 


said bus including a plurality of bus lines for 


carrying substantially all address, data and control 
information needed by said memory devices, 


20 



said bus carrying device- select information without the 
need for separate device-select lines connected directly to 
individual devices on said bus, whereby said master device 


initiates bus transactions which transfer information 


25 


between devices on said bus and collisions on said bus are 
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avoided because said master device avoids initiating bus 
transactions which would conflict with current or pending 
bos transactions* 

47. The bus subsystem of claim 46 having at least two of 
said master devices and including 

a collision detecting means whereby a first said master 
device sending a first said request packet can detect a 
second said master device sending one of said colliding 
request packets, where one of said said colliding request 
packet may be sent simultaneous with the initial sending of 
or overlapping the sending of said first request packet, and 

an arbitration means whereby said first and said second 
master devices select a priority order in which each of said 
master devices will be allowed to access said bus 
sequentially* 

48 • The bus subsystem of claim 47 wherein each of said 
master devices has a master ID number and each of said request 
packets includes a master ID position which is a predetermined 
number of bits in a predetermined position in said request 
packet, and wherein said collision detection means comprises 

a means included in each master device for sending a 

request packet including said master ID number of said 
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raster device in said master ID position of said request 
packet and 

a means to detect a collision and invoke said 
arbitration means if any master device detects any other 
master ID number in said master ID position. 

49* The bus subsystem of claim 47 wherein each of said 
master devices includes 

a means for sending a request packet, 

a means for driving a selected bus line or lines during 
at least one selected bus cycle while said request packet is 
being sent, 

a means for monitoring said selected bus line or lines 
to see if a said master device is sending a colliding 
request packet and 

a means for informing all other master devices that a 
collision has occurred and for invoking said arbitration 
means. 

50. The bus subsystem of claim 47 wherein each of said 
master devices includes 

a means, when sending a request packet, to drive a 
selected bus line or lines with a certain current during at 
least one selected bus cycle, 
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a means for monitoring said selected bus line or lines 
for a greater than normal current to see if another master 
device is driving that line or lines, 

a means for detecting said greater than normal current, 

and 

a means for informing all said master devices that a 
collision has occurred and for invoking said arbitration 
means. 

51. The bus subsystem of claim 47 wherein said arbitration 
means comprises 

a means for initiating em arbitration cycle, 

a means for allocating a single bus line to each master 
device during at least one selected bus cycle relative to 
the start of said arbitration cycle, 

a means for allocating each master device to a single 
bus line during one of said selected bus cycles if there are 
more master devices than available bus lines, 

a means for each of said master devices which sent a 
colliding request packet to drive said bus line allocated to 
said master device during said selected bus cycle, and 

a means in at least one of said master devices for 
storing information about which master devices sent a 
colliding request packet, 

High Performance Bus Interface -79- 


/ 1 


whereby said master devices can monitor selected bus 
lines during said arbitration cycle and identify each said 
master device which sent a colliding request packet. 

52 ♦ The bus subsystem of claim 47 vherein said arbitration 
means comprises 

a means included in a first one of said master devices 
which sent colliding request packets for identifying each of 
said master devices which sent colliding request packets, 

a means for assigning a priority to each said master 
device which sent a colliding request packet, and 

a means for allowing each said master device which sent 
a colliding request packet to access the bus sequentially 
according to that priority. 

53. The bus subsystem of claim 52 wherein said priority is 
based on the physical location of each of said master devices. 

54. The bus subsystem of claim 52 wherein said priority is 
based on said master ID number of said master devices. 

55. The bus subsystem of claim 52 wherein each of said 
master devices includes a means, when sending a colliding request 
packet, for deciding which master device can send the next 
request packet in what order or at what time, whereby no master 

4 
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device aay send a new request packet until responses to each 
pending request packet have been completed or scheduled. 

56. A bus subsystem comprising 
5 a plurality of semiconductor devices connected in 

parallel to a bus, 

said bus including a plurality of bus lines for 
carrying substantially all address, data and control 
information needed by said semiconductor devices, 
lO^i said control information including device-select 

f Z information , 

>*•* ****** 

JJ; said bus containing substantially fewer lines than the 

number of bits in a single address, 
f said bus carrying said device-select information with- 

1«H out the need for separate device-select lines connected 

D directly to individual semiconductor devices, 

fn said semiconductor devices including a reset means 

having an input and an output, the output of the reset means 
of one semiconductor device being connected to the input of 
20 the reset means of the next semiconductor device in series. 

* 

57. The bus subsystem of claim 56 further including system 
reset means comprising 

a means for generating a first and a second reset 
25 signal, ~ 
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a «eans for passing said first reset signal to a first 
of said semiconductor devices and then to subsequent ones of 
said semiconductor devices in series and 

a means for passing a second reset signal to said first 
semiconductor device and then to said subsequent 
semiconductor devices in series, 
said bus subsystem including one of said semiconductor devices 
containing 

a device identification register adapted to contain a 
number unique to said semiconductor device within said bus 
subsystem, 

a device identification register setting means, and 
a device reset means for resetting said semiconductor 
device to some desired, known reset state in response to 
said first reset signal and for setting said device 
identification register in response to said second reset 
signal, 

whereby said bus subsystem can be reset to a known 
reset state with a unique device identification value in 
said device identification register of each of said 
semiconductor devices* 

58. Ihe bus subsystem of claim 57 wherein said desired, 
known reset state is where all registers in the semiconductor 
device are cleared and the state machines are reset. ~ - 
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59. The bus subsystem of claim 57 wherein said device 
identification register setting means comprises 

a means for detecting said second reset signal, 

a means for reading a device identification number from 

said bus line 8 at a specific time relative to said second 

reset signal and 

a means for storing said device identification number 

in said device identification register of said semiconductor 

device. 

60. The bus subsystem of claim 57 wherein said second reset 
signal comprises multiple pulse sequences and wherein said device 
identification setting means includes 

a means for interpreting said pulse sequences as a 
device identification number and 

a means for storing said device identification number 
in said device identification register of said semiconductor 
device. 

61. The bus subsystem of claim 57 wherein said device reset 
means comprises an ji-stage shift register capable of storing li- 
bit values, wherein said device reset means interprets a specific 
value in said shift register as said first reset signal and 
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interprets a specific value in said shift register as said second 
reset signal* 

62* The bus subsystem of claim 57 wherein one of said 
semiconductor devices is a master device, said master device 
including a means for generating said first and said second reset 
signals. 

63* The bus subsystem of claim 57 wherein one of said 
semiconductor devices is a master device, said master device 
including - - 

a master ID register, 

a means for assigning a master ID number to said master 
device and 

a means for storing said master ID number in said 
master ID register* 

64. The bus subsystem of claim 63 further comprising a 
second one of said master devices, and a means for a first one of 
said master devices to assign a master ID number to substantially 
all other said master devices, whereby said first master device 
assigns one of said master ID numbers to each of said master 
devices on said bus subsystem and each said master device stores 
said assigned master ID number in said master ID register. 

* * 
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65* The bus subsystem of claim 57 wherein one of said 
semiconductor devices includes a device -type register adapted to 
contain an identifier characteristic of that type of 
semiconductor device, and one or more modifiable registers , at 
5 least one of which is an access-time register adapted for storing 
access times. 

66. The bus subsystem of claim 65 wherein one of said 
semi conductor devices is a master device having 
10 a means for selecting a semiconductor device, 

in a means for reading said device-type register of said 

i selected semiconductor device, 

r* i V. 

3 a means for determining the device type of said 

y= selected semiconductor device, 

J£ a means for determining access -time values appropriate 

y for said selected semiconductor device and for storing said 

*« ' 'Jt 
■M Hi, 

ffi access -time values in said access-time registers of said 

selected semiconductor device, and 

a means for selecting and storing other values 
20 appropriate for said selected semiconductor device in 

corresponding registers of said selected semiconductor 

device , 

whereby said master device can select a semiconductor 
device, determine what type it is, and set said access-time 
25 and other registers to contain appropriate values* . 

4 

f 
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67. The bus subsystem of claim 66 further comprising a 
memory device having at least one discrete memory section and at 
least one modifiable address register adapted to store memory 
address information which corresponds to each of said discrete 
5 memory sections , and 

said master device further comprising a means for selecting 
and testing each of said discrete memory sections and a means for 
storing address Information in said address registers 
C3 corresponding to each of said discrete memory sections, whereby 
10 said master device can test all said discrete memory sections and 
m assign unique address values thereto. 

68* A bus subsystem comprising 
f* two semiconductor devices connected in parallel to a 

ft 

§$ bus, one of said semiconductor devices being a master 

tfl device, 

said bus including a plurality of bus data lines for 
carrying substantially all address, data and control 
information needed by said semiconductor devices, 

20 said control information including device-select 

information, 

said bus containing substantially fewer of said bus 
data lines than the number of bits in a single address, and 
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said bus carrying device-select information without the 
need for separate device-select lines connected directly to 
individual semiconductor devices, 

wherein all of said bus data lines are terminated 
transmission lines and all of said address, data and control 
information is carried on said bus data lines as a 
sequential series of bits in the form of low-voltage-swing 
signals • 

69. The bus subsystem of claim 68 further comprising a 
semiconductor device including a current-mode driver connected to 
drive one of said bus data lines ♦ 

70* The bus subsystem of claim 69 further comprising a 
semiconductor device having a means to measure the voltage of 
said low-voltage-swing signals on a selected one of said bus data 
lines, whereby said semiconductor device can determine whether 
zero, one, or more than one of said current-mode drivers are 
driving said selected bus data line. 

71. The bus subsystem of claim 70 further comprising a 
semiconductor device having 

a plurality of input receivers connected to one of said 
bus data lines, and 
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a selection means for selecting said input receivers 
one by one to sense and store, one at a time, the bits of 
said sequential series of bits* 

72. The bus subsystem of claim 70 further comprising a 
semi conductor device having two input receivers connected to one 
of said bus data lines* 

73* A bus subsystem comprising 

two semiconductor devices connected in parallel to a 
bus having a first and a second end, said bus including a 
bus clock line, said bus clock line having first and second 
ends corresponding to said first and second ends of said 
bus, respectively, 

a clock generator connected to said first end of said 
bus clock line to generate early bus clock signals with a 
normal rise time, and 

signal return means at said second end of said bus 
clock line to return said early bus clock signals to said 
first end of said bus as corresponding late bus clock 
signals , 

whereby each of said early bus clock signals will 
propagate from said clock generator along said clock line 
starting' from said first end to said second end of said bus 
and then return at a later time to said first end of said 

High Performance Bus Interface -88- 


bus as a corresponding late bus clock signal, whereby each 
semiconductor device on said bus can detect said early bus 
clock signals and said corresponding late bus clock signals. 

74. The bus subsystem of claim 73 further comprising a 
first and a second said bus clock line having first and second 
ends at said first and said second ends of said bus, 
respectively, wherein said signal return means directly connects 
said second ends of said first and said second bus clock lines 
whereby each of said early bus clock signals will propagate from 
said clock generator at said first end of said bus along said 
first bus clock line to said second end of said bus and then 
return on said second bus clock line to said first end of said 
bus as one of said corresponding late bus clock signals. 

75. The bus subsystem of claim 73 wherein said Bignal 
return means comprises said first bus clock line without a line 
terminator at said second end thereof whereby each of said early 
bus clock signals reaching said second end of said first bus 
clock line will be reflected back along said first bus clock line 
as said corresponding late bus clock signals. 


High Performance Bus Interface -89 



76* Ike bus subsystem of claim 73 further comprising 

a means for operating said bus in bus cycles timed to 
have a certain bus cycle frequency and a corresponding bus 
cycle period and 
5 a means for operating said clock generator with a 

period of twice the bus cycle period. 

77. The bus subsystem of claim 76 wherein said bus cycle 
frequency is greater than approximately 50 KHz and less than or 
100 equal to approximately 500 KHz. 

If 78. The bus subsystem of claim 73 further including a 

^0 semiconductor device having an internal device clock generating 

M means to derive the midpoint time between said early and 

lj|j corresponding late bus clock signals and to generate an internal 

,S device clock synchronized to said midpoint time* 

<* ** 

79. The bus subsystem of claim 73 further including a 
semiconductor device having a low-skew clock generator circuit 
20 comprising 

a first delay line having an input, an output and a 
basic delay and means for synchronizing the output of said 
first delay line with said early bus clock signal, 

a second delay line having said basic delay plus a 
25 variable delay, Baid second delay line having an output and 
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a means for synchronising the output of said second delay 
line with said late bus clock signal, and 

a third delay line having a third delay and a means to 
set said third delay midway between the delays of said first 
and second delay lines, said third delay line having an 
output which provides an internal device clock signal 
synchronised to a time halfway between said early and said 
late bus clock signals* 

80* The bus subsystem of claim 73 wherein said early and 
said late bus clock signals are low-voltage-swing signals that 
transition cyclically between low and high logical values, and 
further including a semiconductor device having a low-skew clock 
generator circuit comprising 

a DC amplifier to convert said early and said late bus 
clock signals into full-swing logic signals, 

a first variable delay line having a first variable 
delay and an input and an output, the input of said first 
variable delay line being connected to said DC amplifier 
a first, a second and a third additional delay line, 
each having an input and an output, the input of each of 
said additional delay lines being connected to the output of 
said first delay line, 

said first additional delay line having a fixed 

delay, 
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said second additional delay line having said 

fixed delay plus a second variable delay, and 

said third additional delay line having said fixed 

delay plus one half of said second variable delay, 

a first clocked input receiver connected to sample said 
early bus clock signal and gated by said output of said 
first additional delay line, 

a means for adjusting said first variable delay so said 
first clocked input receiver samples said early bus clock 
signal just as said early bus clock signal transitions, 

a second clocked input receiver connected to sample 
said late bus clock signal and gated by said output of said 
second additional delay line, 

a means for adjusting said second variable delay so 
said second clocked input receiver samples said late bus 
clock signal just as said late bus clock signal transitions, 

whereby said output of said third additional delay line 
is synchronized to a time halfway between said outputs of 
said first and said second additional delay lines, and said 
output of said third additional delay line provides an 
internal device clock signal. 

81. The bus subsystem of claim 80 further comprising a 
semiconductor device having 
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a first one of said low-skew clock generator 
circuits which generates a "true" internal device clock 
signal and 

a second one of said low-skew clock generator 
circuits connected to generate a "complement" internal 
device clock signal synchronized with but opposite in 
logical value to said "true" internal device clock 
signal. 

82. A DRAM device designed to be connected to an external 
bus having a plurality of bus lines for carrying substantially 
all address, data and control information needed by said DRAM 
device as a sequential series of bits, said control information 
including device-select information, said external bus containing 
substantially fewer said bus lines than the number of bits in a 
single address, and said bus carrying device-select information 
without the need for separate device-select lines connected 
directly to said DRAM device, said DRAM device comprising 

an array of memory cells connected in rows and columns, 

each of said memory cells adapted to store one of said bits, 
a row address selection means for selecting one of said 

rows, 

a column sense amp connected to each of said columns, 
each of said column sense amps adapted to latch one of said 
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bits as a binary logical value or to precharge to a selected 
state, 

a column decoding means connected to each of said 
column sense amps for selecting a plurality of said column 
sense amps for inputting one of said bits to or outputting 
one of said bits from said memory cells, 

an internal I/O bus having a plurality of internal I/O 
lines wherein each of said internal I/O lines is connected 
to a plurality of said column sense amps, and 

a plurality of bus connection means designed to connect 
said internal I/O lines to said external bus, 

whereby a selected bit of said sequential series of 
bits can be transferred from said external bus to a selected 
one of said memory cells or said bit contained in a selected 
one of said memory cells can be transferred to said external 
bus. 

83 • The DRAM device of claim 82 further comprising 

an output driver connected to one said bus connection 
means, 

an output multiplexer having an output connected to 
said output driver and a plurality of inputs, each of said 
inputs being connected to one of said internal I/O lines, 
and 
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a control means to select whether said output driver 
can drive said external bus, 

whereby a plurality of memory cells are selected using 
said row address selection means and said column decoding 
means and a plurality of bits contained in said plurality of 
memory cells are output through said column sense amps to 
said internal I/O bus to said output multiplexer to said 
output driver to said external bus. 

84. The DRAM device of claim 82 further comprising 

a plurality of input receivers connected to one of said 
bus data lines and to said internal I/O bus, 

a selection means for selecting said input receivers 
one by one to sense and 6 tore , one at a time, the bits of 
said sequential series of bits, and 

a control means to select whether an input receiver can 
drive said internal I/O bus, whereby a bit of said 
sequential series of bits is input from said external bus 
through one of said input receivers to one of said internal 
I/O lines to one of said column sense amps to one of said 
memory cells. 
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85. The DRAM device of claim 82 further comprising 

a first and a second half -array of said memory cells 
wherein each said row of said array of said memory cells is 
subdivided into two parts, 

a first and a second one of said internal I/O buses 
connected to said column sense amps in said first and said 
second ha If -arrays, respectively, and 

a column decoder means to gate selected ones of said 
column sense amps connected to said memory cells in a 
selected row of said first and said second half -arrays 
s imul taneous ly . 

86. The DRAM device of claim 85 wherein said column decoder 
means selects sixteen column sense amps at a time. _ 

87. The DRAM device of claim 82 wherein said external bus 
operates at a certain speed and wherein said DRAM device includes 
four of said internal I/O buses, each of which operates at one- 
fourth the speed of said external bus* 

88. The DRAM device of claim 82 further comprising 

a means for precharging one of said column sense amps 
to a precharged state from which a binary logical value can 
quickly be loaded into said column sense amp, 
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If said column sense amp contains a binary logical 
value, a means for latching the logical value currently 
contained in said column sense amp and 

a means for instructing said DRAM device to precharge 
said column sense amp or latch said binary logical value in 
said column sense amp. 

89. The DRAM device of claim 88 further comprising a means 
for instructing said DRAM device to precharge said column sense 
amp without further instruction whenever said row address 
selection means selects a different one of said rows* 

90* The DRAM device of claim 88 further comprising a means 
for instructing said DRAM device to precharge said column sense 
amp without further instruction at a first or a second 
preselected time after latching the latest said binary logical 
value , said first preselected time being long enough for said 
DRAM to latch said binary logical value into said column sense 
amp and transfer said binary logical value into memory or onto 
one of said internal I/O lines, and said second preselected time 
being a variable which can be stored in said DRAM device whereby 
said DRAM can latch a binary logical value into said column sense 
amp for transferring said binary logical value into or out of a 
selected said memory cell, then precharge to allow a faster 
subsequent read or write. 
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91. A package containing 

a semiconductor die having a aide, circuitry and a 
plurality of connecting areas positioned along or near said 
side, spaced at a selected pitch and connected to said 
circuitry, 

said package comprising a plurality of bus connecting 
means for connecting to a plurality of external bus lines, 
each of said external bus lines corresponding to one of said 
connecting areas, each of said bus connecting means being 

positioned on a first side of said package, 
connected to one said external bus line and to 
said corresponding connecting area on said 
semiconductor die, and 

spaced at a pitch substantially identical to said 
selected pitch of said connecting areas, 
whereby each of said external bus lines can be 
connected to said corresponding connecting area on said 
semiconductor die by bus connection means positioned along a 
single side of said package. 

92. The package of claim 91 further comprising a plurality 
of said bus connecting means wherein each of said bus connecting 

means includes 

a pin adapted ^f or connection to one of said external 

bus lines and 
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a wire connecting said pin to one of said connecting 
areas on said semiconductor die, 

T 

said wire having an effective lead length less than about 4 
millimeters and wherein the effective lead length of said wire of 
5 each of said bus connection means for said package is 
approximately equal. 

93. A plurality of packages of claim 91 wherein at least 
two of said semiconductor die are memory devices, each of said 

10 o P&ckages being generally flat, having a top and a bottom, and 
il wherein 

said packages are physically secured adjacent and parallel 
:Z S to each other in a stack, 

where a first one of said packages is adjacent to a second 
15 one of said packages in said stack, said top of said first 

[U package is substantially aligned with said bottom of said second 

nil*' 

tfl package, and 

HI "*^- 

said bus connecting means of each of said packages are 
substantially aligned and are lying substantially in a plane. 

20 

94. The plurality of packages of claim 93 further 
comprising a plurality of stacks wherein each of said bus con- 
necting means can be electrically connected to corresponding said 
bus connecting means in each of said stacks. 

25 
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95. A semiconductor device capable of use In a semi- 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying substantially all address, 
data, control and device-select information needed by said 
semiconductor device for communication with substantially every 
other semiconductor device connected to said bus, and has 
substantially fever bus lines than the number of bits in a single 
address, and carries device-select information for said 
semiconductor device without the need for a separate device- 
select line connected directly to said individual semiconductor 
device, said semiconductor device comprising 

connection means adapted to connect said semiconductor 
device to said bus, and 

at least one modifiable identification register 
accessible to said bus through said connection means, 
whereby data may be transmitted to said register via said 
bus and enable said device thereafter to be uniquely 
identified. 

96. The semiconductor device of claim 95 wherein said 
semiconductor device is a memory device which connects 
substantially only to said bus and sends and receives 
substantially all address, data and control information over said 
bus. 
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97. A semiconductor device capable of use in a semi- 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying substantially all address, 
data, control and device-select information needed by said 
semiconductor device for communication with substantially every 
other semiconductor device connected to said bus, and has 
substantially fewer bus lines than the number of bits in a single 
address, and carries device-select information for said 
semiconductor device without the need for a separate device- 
select- line connected directly to said individual semiconductor 
device, said semiconductor device comprising 

connection means adapted to connect said semiconductor 

device to said bus, and 

at least one modifiable register to hold device address 
information, said modifiable register accessible to 
said bus through said connection means, whereby data 
may be transmitted to said register via said bus which 
enables said device thereafter to respond to a 
predetermined range of addresses. 

98. The semiconductor device of claim 97 wherein said 
semiconductor device is a memory device which connects 
substantially only to said bus and sends and receives 

> 
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substantially all address, data and control information over said 
bus* 

99. The semiconductor device of claim 98 wherein said 
memory device has at least one discrete memory section and also 
has at least one modifiable address register adapted to store 
memory address information which corresponds to each said 
discrete memory section. 

100. The semiconductor device of claim 99 wherein said 
memory address information comprises a pointer to said discrete 
memory section. 

* 

101. The semiconductor device of claim 100 wherein said, 
discrete memory section has a top and a bottom and said memory 
address information comprises pointers to said top and said 
bottom. 

102. The semiconductor device of claim 100 wherein said 
memory address information comprises 

a pointer to said discrete memory section and 
a range value indicating the size of said discrete 
memory section. 
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103 * A semiconductor device capable of use in a semi- 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying substantially all address, 
data and control information needed by said semiconductor device 
for communication with substantially every other semiconductor 
device connected to said bus, and has substantially fewer bus 
lines than the number of bits in a single address, said 
semiconductor device comprising 

connection means adapted to connect said semiconductor 

device to said bus, and 

at least one modifiable access-time register accessible 
to said bus through said connection means, whereby data may 
be transmitted to said register via said bus which 
establishes a predetermined amount of time that said 
semiconductor device thereafter must wait before using said 
bus in response to a request* 

104, The semiconductor device of claim 103 wherein said 
semiconductor device is a memory device which connects 
substantially only to said bus and sends and receives 
substantially all address, data and control information over said 
bus* 
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105. The semiconductor device of claim 103 further 
comprising at least two access-time registers and one of said 
access -time registers is permanently programmed to contain a 
fixed value and at least one of said access-time registers can be 
modified by information carried on said bus* 

106 « A semiconductor device capable of use in a semi- 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying substantially all address , 
data, control and device-select information needed by said 
semiconductor device for communication with substantially every 
other semiconductor device connected to said bus, and has 
substantially fewer bus lines than the number of bits in a single 
address, and carries device-select information for said 
semiconductor device without the need for a separate device- 
select line connected directly to said individual semiconductor 
device, and wherein each said bus line is a terminated 
transmission line, said semiconductor device comprising 

connection means adapted to connect said semiconductor 
device to said bus, and 

a bus line driver capable of producing a low-voltage- 
swing signal on one of said terminated transmission lines. 
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107. The semiconductor device of claim 106 wherein said 
semiconductor device is a memory device which connects 
substantially only to said bus and sends and receives 
substantially all address, data and control information over said 
bus. 

108. A semiconductor device capable of use in a semi- 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying substantially all address, 
data, control and device-select information needed by said 
semiconductor device for communication with substantially every 
other semiconductor device connected to said bus, and has 
substantially fewer bus lines than the number of bits in a single 
address, and carries device -select information for said 
semiconductor device without the need for a separate device- 
select line connected directly to said individual semiconductor 
device, said bus further including at least one bus clock line 
for carrying early and late bus clock signals, said semiconductor 
device comprising 

connection means adapted to connect said semiconductor 
device to said bus, and 

an internal device clock generating means which 
generates an internal device clock synchronised to a time 
halfway between said early and said late bus clock signals. 
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109. The semiconductor device of claim 108 wherein said bus 
further includes a first and a second one of said bus clock 
lines, said first bus clock line carries said early bus clock 
signal and said second bus clock line carries said late bus clock 
5 signal, said semiconductor device further comprising a means to 
detect said early bus clock signal on said first bus clock line 
and a means to detect said late bus clock signal on said second 
bus clock line. 

10C3 110. The semiconductor device of claim 109 wherein said 

semiconductor device is a memory device which connects 
yi substantially only to said bus and sends and receives 
Iq substantially all address, data and control information over said 

™ bUS r 

IS 111. A semiconductor device capable of use in a semi- 

conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying as a sequential series of 

20 bits substantially all address, data, control and device-select 

information needed by said semiconductor device for communication 
with substantially every other semiconductor device connected to 
said bus, and has substantially fewer bus lines than the number 
of bits in a single address, and carries device-select 

25 information for said semiconductor device without the need for a 
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separate device-select line connected directly to said individual 
semiconductor device , said semiconductor device comprising 

connection means adapted to connect said semiconductor 

device to said bus, 

a plurality of input receivers connected to one of said 

bus data lines and 

a selection means for selecting said input receivers 
one by one to sense and store, one at a time, the bits of 
said sequential series of bits. 

112. The semiconductor device of claim 111 wherein said 
semiconductor device is a memory device which connects 
substantially only to said bus and sends and receives 
substantially all address, data and control information over said 

bus. 

113. The semiconductor device of claim 112 wherein two input 
receivers are connected to one of said bus lines. 

114. A semiconductor device capable of use in an 
architecture for a semiconductor system bus including a plurality 
of semiconductor devices connected in parallel to a bus wherein 
said bus system includes a plurality of bus lines for carrying 
substantially all address, data, control and device-select 
information needed by said semiconductor device for communication 
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with substantially every other semiconductor device connected to 
said system bus, and has substantially fewer bus lines than the 
number of bits in a single address, and carries device-select 
Information for said semiconductor device without the need for a 
separate device-select line connected directly to said individual 
semiconductor device, said semiconductor device comprising 

connection means adapted to connect said semiconductor 
device to said system bus, 

an internal input/output bus within said semiconductor 
device having more lines than said system bus, and 

a means for multiplexing the lines of said internal bus 
to the lines of said system bus, whereby said system bus can 
run at a higher speed than said internal bus. 

115. The semiconductor device of claim 114 wherein said 
semiconductor device is a memory device which connects 
substantially only to said system bus and sends and receives 
substantially all address, data and control information over said 
system bus. 

116. A semiconductor device capable of use in an 
architecture for a semiconductor system bus including a plurality 
of semiconductor devices connected in parallel to a bus wherein 
said system bus includes a plurality of bus lines for carrying 
substantially all address, data, control and device-select 
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information needed by said semiconductor device for communication 
with substantially every other semiconductor device connected to 
said system bus, and has substantially fewer bus lines than the 
number of bits in a single address, and carries device-select 
information for said semiconductor device without the need for a 
separate device-select line connected directly to said individual 
semiconductor device, said semiconductor device comprising 

connection means adapted to connect said semiconductor 
device to said system bus, 

an internal input/output bus within said semiconductor 
device having more lines than said system bus, 

a means for multiplexing the lines of said internal bus 
to the lines of said system bus, whereby said system bus can 
run at a higher speed than said internal bus, and 

at least one modifiable identification register 
accessible to said system bus through said connection means, 
whereby data may be transmitted to said register via said 
system bus and which enables said device thereafter to be 
uniquely identified* 

117. The semiconductor device of claim 116 wherein said 
semiconductor device is a memory device which connects 
substantially only to said system bus and sends and receives 
substantially all address, data and control information over said 
system bus* 
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118* A semiconductor device capable of use in an 
architecture for a semiconductor system bus including a plurality 
of semiconductor devices connected in parallel to a bus wherein 
said system bus includes a plurality of bus lines for carrying 
substantially all address, data, control and device-select 
information needed by said semiconductor device for communication 
with substantially every other semiconductor device connected to 
said system bus, and has substantially fewer bus lines than the 
number of bits in a single address, and carries device-select 
information for said semiconductor device without the need for a 
separate device-select line connected directly to said individual 
semiconductor device, said semiconductor device comprising 

connection means adapted to connect said semiconductor 
device to said system bus, 

an internal input/output bus within said semiconductor 
device having more lines than said system bus, 

a means for multiplexing the lines of said internal bus 
to the lines of said system bus, whereby said system bus can 
run at a higher speed than said internal bus, and 

at least one modifiable register to hold device address 
information, said modifiable register accessible to said 
system bus through said connection means, whereby data may 
be transmitted to said register via said system bus which 
enables said device thereafter to respond to a predetermined 
range of addresses. 
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119. The semiconductor device of claim 118 wherein said 
semiconductor device is a memory device which connects 
substantially only to said system bus and sends and receives 
substantially all address, data and control information over said 
system bus. 

120. The semiconductor device of claim 119 wherein said 
memory device has at least one discrete memory section and also 
has at least one modifiable address register adapted to store 
memory address information which corresponds to each said 
discrete memory section* 

121* A semiconductor device capable of use in an 
architecture for a semiconductor system bus including a plurality 
of semiconductor devices connected in parallel to a bus wherein 
said system bus includes a plurality of bus lines for carrying 
substantially all address, data and control information needed by 
said semiconductor device for communication with substantially 
every other semiconductor device connected to said system bus, 
and has substantially fewer bus lines than the number of bits in 
a single address, said semiconductor device comprising 

connection means adapted to connect said semiconductor 

device to said system bus, 

an internal input/output bus within said semiconductor 

device having more lines than said system bus, 
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a means for multiplexing the lines of said internal bus 
to the lines of said system bus, whereby said system bus can 
run at a higher speed than said internal bus, and 

at least one modifiable access-time register accessible 
to said system bus through said connection means, whereby 
data may be transmitted to said register via said system bus 
which establishes a predetermined amount of time that said 
semiconductor device thereafter must wait before using said 
system bus in response to a request. 

122. The semiconductor device of claim 121 wherein said 
semiconductor device is a memory device which connects 
substantially only to said system bus and sends and receives 
substantially all address, data and control information over said 
system bus. 

123. The semiconductor device of claim 121 further 
comprising at least two access-time registers and one of said 
access-time registers is permanently programmed to contain a 
fixed value and at least one of said access-time registers can be 
modified by information carried on said system bus. 

124. A semiconductor device capable of use in a semi- 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
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e plurality of bus lines for carrying substantially all address, 
data, control and device-select information needed by said 
semiconductor device for communication with substantially every 
other semiconductor device connected to said bus, and has 
substantially fewer bus lines than the number of bits In a single 
address, and carries device-select information for said 
semiconductor device without the need for a separate device* 
select line connected directly to said individual semiconductor 
device, wherein said address, data, control and device-select 
information is carried over said bus in the form of request 
packets and bus transactions, said semi conductor device 
comprising 

connection means adapted to connect said semiconductor 
device to said bus, 

a means to receive said request packets over said bus, 
a means to decode information in said request packets, 

and 

a means to respond to said information in said request 
packets • 

125. The semi conductor device of claim 124 wherein said 
means to decode information in said request packet further 
comprises 

a means to identify and decode said control information 
in said request packet, 
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a means to Identify and decode said device-select 
information in said request packet, r . . 

* ■ 

a means to identify and decode said address information 
in said request packet and 

a means to determine whether said control information 
or said address information instructs said semiconductor 
device to begin a response* 

126. The semiconductor device of claim 124 wherein each of 
said bus transactions is carried out in response to said address 
and said control information in one of said request packets and 
wherein said means to identify and decode information in said 
request packets includes a means to identify a sequence of bytes 
on said bus as one of said request packets containing said — 
address and said control information, said control information 
including information about the type of said bus transaction 
being requested and the access time which needs to intervene 
before beginning said bus transaction over said bus and said 
address and said control information includes device-select 
information instructing one or more said semiconductor devices to 
respond to said address and said control information* 

127. The semiconductor device of claim 124 further 
comprising 
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a plurality of sense amplifiers adapted to precharge to 
a selected state or to latch a bit of information , 

a means to hold said sense amplifiers in an unmodified 
state after latching one of said bits of information, 

a means to precharge said sense amplifiers and 

a means for selecting whether said semiconductor device 
should precharge said sense amplifiers or should hold said 
sense amplifiers in an unmodified state. 

128. The semiconductor device of claim 124 wherein said 
means to respond to said information/ where said information is 
control information, further comprises a means to 

transfer a data block during a data block transfer, 
further including a means to 

read data from said semiconductor device and 
write data into said semiconductor device, and 
initiate a data block transfer, 
transfer a data block of a selected size, 
transfer a data block at a selected time, 
access a control register, including a means to read 
from or write to said control register, or 
select normal or page-mode access. 

129 • The semiconductor device of claim 124 further 
comprising a means to respond to said information in said request 
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packet if said information includes a device identification 
number unique to said semiconductor device* 

130. The semiconductor device of claim 124 further 
comprising a means to respond to said information in said request 
packet if said information includes a special device 
identification number which calls for said semiconductor device 
to respond. 

131. The semiconductor device of claim 124 further 
comprising a means to respond to said information in said request 
packet if said information includes an address unique to said 
semiconductor device. 

132. The semiconductor device of claim 124 further 
comprising a means to interpret said control information and 
decode the time to wait before beginning said bus transaction 
over said bus. 

133. The semiconductor device of claim 124 further 
comprising a means to interpret said control information and 
decode the size of a data block to transfer during one of said 
bus transactions. 

t 

* 
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134. The semiconductor device of claim 124, 125 , 126, 127 , 
128, 129, 130, 131, 132 or 133 wherein said semiconductor device 
is a memory device which connects substantially only to said bus 
and sends and receives substantially all address, data and 
control information over said bus. 

135. A semiconductor device capable of use in a semi* 
conductor bus architecture including a plurality of semiconductor 
devices connected in parallel to a bus wherein said bus includes 
a plurality of bus lines for carrying substantially all address, 
data, control and device-select information needed by said 
semiconductor device for communication with substantially every 
other semi conductor device connected to said bus, and has 
substantially fewer bus lines than the number of bits in a single 
address, and carries device-select information for said 
semiconductor device without the need for a separate device- 
select line connected directly to said individual semiconductor 
device, wherein said address, data, control and device-select 
information is carried over said bus in the form of request 
packets and bus transactions, said semiconductor device 
comprising 

connection means adapted to connect said semiconductor 
device to said bus, 

a means to encode address and control Information in 
said request packets and 
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a means to send said request packets over said bus* 

* 

136* The semiconductor device of claim 135 further 
comprising a means to request a bus transaction wherein each of 
said bus transactions is carried out in response to said address 
and said control information in one of said request packets, and 
wherein said means to encode information in said request packets 
includes a means to mark a sequence of bytes on said bus as one 
of said request packets, said control information including 
information about the type of said bus transaction being 
requested and the access time which needs to intervene before 
beginning said bus transaction over said bus and said address and 
said control information includes device-select information 
instructing one or more said semiconductor devices to respond to 
said address and said control information. 

137. The semiconductor device of claim 135 wherein one or 
more of said plurality of semiconductor devices has a unique 
device identification number, said semiconductor device further 
comprising a means to send control information to a specific one 
of said plurality of semiconductor devices by including in said 
request packet a selected said device identification number. 

138. The semiconductor device of claim 135 wherein each of 
said plurality of semiconductor devices is adapted to respond to 
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a special device identification number, said semiconductor device 
further comprising a means to send control information to each of 
said plurality of semiconductor devices by including in said 
request packet said special device identification number. 

5 

139 « The semiconductor device of claim 135 wherein one or 
more of said plurality of semiconductor devices is a memory 
device having a plurality of addresses, said semiconductor device 
further comprising a means to send control Information to a 
10D specific address or range of addresses in one of said plurality 
q of semiconductor devices by including said specific address or 
t range of addresses in said request packet* 

u ; 140. The semiconductor device of claim 135 wherein at least 

15 one of said request packets is a request packet requesting a bus 
S J: transaction which is followed by a corresponding one of said bus 
m transactions, said semiconductor device further comprising a 

means to encode said control information to specify directly or 

+ 

indirectly the time between the end of said request packet 
20 requesting a bus transaction and said corresponding bus 
transaction over said bus. 

141. The semiconductor device of claim 140 wherein one type 
of said bus transactions is a transfer of a data block, said 
25 semiconductor device further comprising a means to encode said 
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control Information to specify the size of said data block to 
transfer. *" . 

142. The semiconductor device of claim 140 further 
comprising a means to keep track of current and pending bus 
transactions, whereby collisions on said bus are avoided because 
said semiconductor device avoids initiating bus transactions 
vhich would conflict with current or pending bus transactions. 

143. The semiconductor device of claim 135 wherein said 
semiconductor device is a first master device and one of said 
plurality of semiconductor devices is a second master device , 
further comprising 

a collision detecting means whereby said first master 
device when sending a first one of said request packets can 
detect said second master device sending a colliding one of 
said request packets, where said colliding request packet 
may be sent simultaneous with the initial sending of or 
overlapping the sending of said first request packet, and 

an arbitration means whereby said first and said second 
master devices select a priority order in which each of said 
master devices will be allowed to access said bus 
sequentially. 
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144* The semiconductor device of claim 143 Wherein said 
semiconductor device ie a master device and at least one of said 
plurality of semiconductor devices is a master device, each of 
said master devices has a master ZD number and each of said 
request packets includes a master ID position which is a 
predetermined number of bits in a predetermined position in said 
request packet, and wherein said collision detection means 
comprises 

a means for said semiconductor device to send its 
master ID number in said request packet and 

a means to detect a collision and invoke said 
arbitration means if said semiconductor device detects any 
other master ID number in said master ID position. 

145. The semiconductor device of claim 144 wherein said 
system bus architecture includes a means for carrying information 
on said bus during bus cycles, said semiconductor device further 
comprising 

a means for driving a selected bus line or lines during 
at least one selected bus cycle while sending each said 
request packet, 

a means for monitoring said selected bus line or lines 
to see if another said master device is sending one of said 
colliding request packets and 

< 

High Performance Bus Interface -121- 


a means for informing all said Blaster devices that a 
collision has occurred and for invoking said arbitration 
neans. 

146. The semiconductor device of claim 145 further 
comprising 

a means , when sending a request packet, for driving a 
selected bus line or lines with a certain current during at 
least one selected bus cycle, 

a means for monitoring said selected bus line or lines 
for a greater than normal current to see if another said 
master device is driving that line or lines, 

a means for detecting said greater than normal current/ 

and 

a means for informing all said master devices that a 
collision has occurred and for invoking said arbitration 
means ♦ 

147. The semiconductor device of claim 143 wherein said 
arbitration means comprises 

a means for initiating an arbitration cycle, 
a means for allocating a single bus line to each said 
master device during at least one selected bus cycle 
relative to the start of said arbitration cycle. 
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a means for allocating each said master device to a 
•ingle bus line during one of said selected bus cycles if 
there are more master devices than available bus lines, 

a means for each of said master devices which sent one 
of said colliding request packets to drive said bus line 
allocated to said master device during said selected bus 
cycle, and 

a means in at least one of said master devices for 
storing information about which master devices sent one of 
said colliding request packets, 

whereby said master devices can monitor selected bus 
lines during said arbitration cycle and identify each said 
master device which sent one of said colliding request 
packets ♦ 

148% The semiconductor device of claim 143 wherein said 
arbitration means comprises 

a means for identifying each of said master devices 
which sent one of said colliding request packets, 

a means for assigning a priority to each said master 
device which sent one of said colliding request packets, and 

a means for allowing each said master device which sent 
one of said colliding request packets to access the bus 
sequentially according to that priority* 
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149* The gem [conductor device of claim 143 wherein said 
priority is based on the physical location of each of said master 
devices . 

5 150* The semiconductor device of claim 143 wherein said 

priority is based on said master ID number of said master 
devices . 
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The present invention includes a memory subsystem comprising 
at least two semiconductor devices, including at least one memory 
device, connected to a bus, where the bus includes a plurality of 
bus lines for carrying substantially all address, data and 
control information needed by said memory devices, where the 
control information includes device-select information and the 
bus has substantially fewer bus lines than the number of bits in 
a single address, and the bus carries device-select information 
without the need for separate device-select lines connected 
directly to individual devices. 

The present invention also includes a protocol for master 
and slave devices to communicate on the bus and for registers in 
each device to differentiate each device and allow bus requests 
to be directed to a single or to all devices • The present 
invention includes modifications to prior-art devices to allow 
them to implement the new features of this invention. In a 
preferred implementation, 8 bus data lines and an AddressValid 
bus line carry address, data and control information for memory 
addresses up to 40 bits wide. 


High Performance Bus Interface -125 


^-^^^-^^^ OFFICE 

(Case No. P043D2C2) 


In the Application of: 

FARMWALD ET AL 


Ser 


ial No: Continuation of 08/798,520 


Filed: NOVEMBER 20, 1998 

Title: INTEGRATED CIRCUIT I/O USING A 
HIGH PERFORMANCE BUS INTERFACE 


Assistant Commissioner for Patents 
Washington, DC 20231 


Group 
Art Unit: 

Before 
Examiner : 


REQUEST TO APPROVE DRAWING CHANGES 


Dear Sir: 


Applicants seek to amend Figure 10 to more fully reflect the 
discussion in the specification, specifically, page 55, lines 12-16 

*ft lines 13-23 Attached is a photocopy of Figure 10 
and page 58, lines u ^ • 

■ ^-i^ori -in red No new matter has been 
with the proposed changes indicated m rea. 


added . 


applicants respectfully request approval o£ the proposed 
changes . A new Figure 10 which incorporates the changes is also 


attached to hereto . 
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Date: November 20, 1998 
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DECLARATION AND POWER OF ATTORNE Y FOR PATENT APPLICATION 


As a below named inventor, X hereby declare that: 

My residence, post office and citizenship are as stated below, 
next to my name. 

I believe Z am the original, first and sole inventor (if only one 
name is listed below) or an original, first and joint inventor 
(if plural names are listed below) of the subject matter which is 
claimed and for which a patent is sought on the invention enti- 
tled INTEGRATED CIRCUIT I/O USING A HIGH PERFORMANCE BUS 
INTERFACE the specification of which 

XX is attached hereto. 

was filed on as 

Application Serial No. 

and was amended on 

(if applicable) 

I hereby state that I have reviewed and understand the contents 
of the above-identified specification, including the claims, as 
amended by any amendment referred to above. I do not know and do 
not believe that the same was ever known or used in the United 
States of America before my invention thereof, or patented or 
described in any printed publication in any country before my 
invention thereof or more than one year prior to this applica- 
tion, that the same was not in public use or on sale in the 
United States of America more than one year prior to this appli- 
cation/ and that the invention has not been patented or made the 
subject of an inventor's certificate issued before the date of 
this application in any country foreign to the United States of 
America on an application filed by me or my legal representatives 
or assigns more than twelve months prior to this application. 

I acknowledge the duty to disclose information which is material 
to the examination of this application in accordance with Title 
37, Code of Federal Regulations, Section 1.56(a). 

I hereby claim the benefit under Title "35, United States Code, 
Section 120 of any United States application ( s ) listed below and, 
insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States appli- 
cation in the manner provided by the first paragraph of Title 35, 
United States Code, Section 112, I acknowledge the duty to dis- 
close material information as defined in Title 37, Code of 


0 


0 


Federal Regulations, Section 1.56 (a) which occurred between the 
filing date of the prior application and the national or PCT 
international filing date of this application t 


(Application Serial No.) (Filing Date) 


( Status -patented 
pending, abandoned) 


(Application Serial No.) (Filing Date) 


( Status -patented 
pending, abandoned) 


Z hereby appoint Roger S. Borovoy, Reg. No. 20,193, and David J. 
Larwood, Reg. No. 33,191, 600 Hansen Way, Suite 100, Palo Alto, 
California 94306, telephone (415) 856-9411, my attorneys with 
full power of substitution and revocation, to prosecute this 
application and to transact all business in the Patent and 
Trademark Office connected herewith. Please address all corres- 
pondence to Hr. Larwood. 

X hereby declare that all statements made herein of my own know- 
ledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 


Full Name of Sole/First Inventors: Michael Farmwald 

Residence 82 Eucalyptus Rd.. Citizenshi p U.S. A, 

Berkeley, California 94705 (Country) 
(City, State) 

Inventor's Signature ^^jJ?.v»\i cJL^^^^aJ Dat e ty »* 


Full Name of Joint/Second Inventor* Mark Horowits 


Residence 2024 Columbia Street Citizenshi p O.S.A« 

Palo Alto, California 94306 A (Country) 


Inventor 


Dat a A^l 17, MP 


Attorney's Docket No.: 73305.P001 — 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 


Patent 


In Re Application of: 

Minhaei F fir™ walri and Mari< Horowitz 
(Inventors(s)) 

Serial No.: 07/510.898 

Filed: April 1 a 1990 

For. INTEGRATED Cr IR ™ IIT lJSING 
A , HIGH PFRFQ RMANCE BUS 


(Title) 


Commissioner of 
Patents and Trademarks 
Washington, D.C. 20231 


Examiner. 


Group Art Unit: 238 


POWER OF ATTORNEY BY ASSIGNEE 
A^n RFVOOATIQ N ^F PRFV ini IS POWERS 


("assignee"), a California 


(state of incorporation) 
24S5_Lalham Str 


(Name of Assignee) 

corporation having a place of 



business at - . 

(address) 

represents that it is the assignee of the entire right, title, and interest in and to 
the above-referenced patent application and that the undersigned Is a 
representative authorized to sign on behalf of the assignee. 

Pursuant to 37 C.F.R. §§ 1.32 and 1.36, the assignee hereby revokes all 
powers of attorney previously given and appoints Bradley J. Bereznak. Reg. No. 
33,474; Roger W. Blakely, Jr., Reg. No. 25,831 ; Jeffrey Jay Blatt, Reg. No. 
30,244; Vernon Randall Gard. Reg. No. 33.886; Stephen D. Gross. Reg. No. 
31,020; David R. Hatvorson, Reg. No. 33,395; George W. Hoover, Reg. No. 
32,992; Michael Hurey. Reg. No. 33.513; Tracy L Hurt, Reg. No. 34,188; 

Eric S. Hyman. Reg. No. 30,139; Stephen L King, Reg. No. 19,180; Maria E. 

McCormack, Reg. No. 31 ,639; James D. McFarland, Reg. No. 32,544; Ronald 

W. Reagin, Reg. No. 20,340; James C. Scheller, Reg. No. 31.195; Ira M. 

Siegel, Reg. No. 28,907; Stanley W. Sokoloff, Reg. No. 25.128; 


Edwin H. Taylor, Reg. No. 25,129; Lester J. Vincent, Reg. No. 31,460; and 
Norman Zafman, Reg. No. 26,250; as Its attorneys; and Keith Q. Askoff, Reg. No. 
33,828, as its patent agent; of BLAKELY, SOKOLOFF, TAYLOR & ZAFMAN, with 
offices located at 12400 Wilshire Boulevard, 7th Floor, Los Angeles, California 
90025, telephone (213) 207-3800, with full power of substitution and 
revocation, to prosecute this application and to transact all business in the 
Patent and Trademark Office connected herewith. 

Pursuant to 37 C.F.R. § 1.32, the assignee hereby states that prosecution 
of the above-referenced patent application is to be conducted to the exclusion 
of the inventor(s). 

Send all future correspondence to Lester J. Vincent , 

Reg. No. 31 460 Blakely, Sokoloff, Taylor, & Zafman, 12400 Wilshire 
Boulevard, Seventh Floor, Los Angeles, California 90025, and direct all 
telephone calls to the same at (408) 720-8598. 


Assignee of Interest: RAMBUS INC. 



By:. 



(Type or Print) 


Name: Geoff Tate 

(Type or Print) 


Title: President and Chief Executive Officer 

(Type or Print) 


Address of Assignee of Interest: 

2465 Latham Street 


Mountain View. California 94040 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

(Case No. P043D2C2) 


In the Application of: ) 


FARMWALD ET AL . ) 

Group 


Art Unit: 

Serial No: Continuation of 08/798,520 ) 



Before 

Filed: Herewith ) 

Examiner : 

Title: INTEGRATED CIRCUIT I/O USING A ) 


HIGH PERFORMANCE BUS INTERFACE ) 



Washington, DC 2 0231 


POWER OF ATTORNEY BY ASSIGNEE, 
REVOCATION OF ALL PRIOR POWERS OF ATTORNEY 

AND 

CERTIFICATE UNDER 37 CFR 3.73(b) 

S i r z 

The undersigned, being empowered to sign this Power of 
Attorney, Revocation of All Previous Powers of Attorney and 
Certificate under 3 7 CFR 3.73(b) on behalf of Rambus, Inc., the 
assignee of the entire right, title and interest in the above- 
referenced application, hereby revokes all prior powers of attorney 
and hereby appoints Neil A. Steinberg, Reg. No. 34, 735 r with full 
power of substitution and revocation to prosecute this application 
and to transact all business before the United States Patent and 
Trademark Office in the above-referenced application. 

Rambus, Inc., formerly a California corporation with a place 
of business at 4920A El Camino Real, Los Altos, California 94022, 
certifies that it is the assignee of the entire right, title and 
interest in the above -referenced patent application by virtue of an 
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assignment from the inventors, Michael Farmwald and Mark Horowitz. 
The assignment of the parent patent application (application serial 
no. 07/510,898) and all continuing and divisional application 
thereof to Rambus Inc. was filed on April 18, 1990 and recorded in 
the U.S. Patent and Trademark Office at Reel 5385, Frame $75 . 

The undersigned has reviewed all the documents in the chain of 
title of the above-referenced application and, to the best of the 
undersigned ! s knowledge and belief, title is in Rambus, Inc., the 

assignee identified above. 

Please direct all correspondence in the above -referenced 

patent application to: 


I hereby declare that all statements made herein of my own 
knowledge are true and that all statements made on information and 
believed to be true; and further that these statements were made 
with the knowledge that willful false statements and the like so 
make are punishable by fine or imprisonment, or both, under Section 
1001 of Title 18 of the United States Code, and that such willful 
false statements may jeopardize the validity of the application, 
any patent issuing thereon. 


Neil A. Steinberg, Esq. 
5827 Osceola Road 


Bethesda, Maryland 20816 
Telephone: 301-229-7706 
Facsimile: 301-229-5882 




Vice President 

Intellectual Property 
Rambus Inc . 
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